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1. Introduction 

The purpose of this proposal is to provide insight into gene environment interactions. It leverages the simplified 
genetics and detailed records of the military working dog population. There are several critical aspects to meeting 
the aims of this proposal. 1) development of data driven selection criteria, 2) biological sampling of representative 
dogs, and 3) generation of mathematical methodologies capable of handling heterogenous data and statistical tests 
in consistent manner and providing clear and understandable results that are biologically valid. Here we provide a 
breakdown of the previous year’s work and document our progress towards achieving the specific aims we 
proposed. 

2. A. Keywords 

Canine, cancer, genetics, genomics, epigenetics, gene-gene interaction, gene-environment interaction, risk, 
longitudinal cohort 

B. Abbreviations and acronyms 

Military Working Dogs (MWDs), Principal Components Analysis (PCA), Multidimensional scaling (MDS), Joint 
Pathology Center (JPC), Armed Forces Institute of Pathology (AFIP), Animal Care and Use Review Office 
(ACURO), Institutional Animal Care and Use Committee (IACUC), Cooperative Research and Development 
Agreement (CRADA) 

3. Accomplishments 

A. What were the major goals of the project? 

The proposal includes technical tasks to make the research possible, but these are not the major goals of the 
project. Because the present funded-project is first-of-its-kind in several respects, its research goals 
encompass more than the top single aim of understanding genetic and genetic/environmental risk factors for 
cancer in the MWDs. The main additional research goals are to establish the ideal analyses to be used and to 
understand their biostatistical properties and performance. The full goals have three elements: 

i. Creation of technological infrastructure 

ii. Selection, development and validation of the most ideal analyses 

iii. Execution of the final study of genetic and gene-environment risk factors for cancer in MWDs 

The first thus classifies as technical implementation whereas the second and third are research. 

B. What was accomplished under these goals 

i. Creation of technological infrastructure 

Task 1: Regulatory approval: regulatory approvals (IACUC, ACURO) and institutional agreements 
(CRADA) were received and executed, respectively. In this reporting period, an IACUC protocol set 
up exclusively to scan health care records (but involving no handling of animals) expired, and there 
has not been any scanning of records since then. This does not affect the separate current IACUC 
protocol for collection of biological samples. A new IACUC protocol to replace that for records 
scanning is pending. 

Task 2: Data capture of veterinary records: This task was directed by PI Alvarez, with support in 
the area of clinical data/health care records from PI Kisseberth. 

Task 2a) As planned, project manager and project programmer traveled to Lackland AFB, San 
Antonio, TX, to visit Military Working Dog (MWD) veterinary hospital in order to acquire 
information on data types, security, etc. and to establish protocols for data acquisition, scanning and 
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transfer between Lackland AFB and The Research Institute at Nationwide Children’s Hospital 
(NCHRI), Columbus, OH. 2b) As planned, regular communication has been maintained between 
NCHRI investigators and our veterinary technician at Lackland AFB. 2c) Our Lackland AFB 
veterinary technician was originally planned to receive training on molecular methods so she could 
process the blood samples at Lackland AFB. This plan was deemed too risky to the quality of the 
biological samples and was changed to shipping fresh blood samples for processing at NCHRI. 2d) 
We delivered to our veterinary technician at Lackland AFB and validated the computer and scanner to 
scan paper records into digital data for transfer to the DAPER database at NCHRI. NCHRI staff 
traveled to Lackland AFT to establish computational infrastructure as planned, but did not need to 
establish molecular infrastructure as originally planned (see 2c). 2e) Ms. Perez and LTC Richard 
reviewed the available breeding program dogs available at Lackland AFB and meeting the study 
criteria. Drs. Kisseberth and Alvarez were also involved in this process. 2f) Drs. Alvarez, Kisseberth 
and Zapata have evaluated possible exposures and biologically relevant events, and are prepared to 
focus such analysis more in depth in the final selected dogs. 2g) Dr. Huang’s programmer 
successfully created the DAPER database in which to collect the MWD health records. They 
completed the introduction of text-to-digital conversion software (ABBYY), and implemented custom 
forms for that conversion for one form and initiated that for pathology reports. In this past year, this 
digital-conversion work was put on hold to focus on more urgent needs for the project. It is 
anticipated that the necessary data capture can be done manually once the final set of dogs is chosen. 
To date, 257 veterinary medical records have been scanned, including for the dogs for which 
biological samples have been collected. An additional approximately 1,000 dogs have been identified 
that could serve for epidemiological analysis together with the 257 records already acquired. 2h) Dr. 
Alvarez and Ms. Perez have continued evaluating progress and protocol compliance, in 
communication with the relevant Lackland AFB personnel. 2i) Statistical analysis has continued, led 
by Drs. Zapata (mainly) and Alvarez. 2j) Dr. Alvarez continued to supervise Ms. Perez in data 
handling, and initiated training of a new data handling assistant (Mr. I. Wolfe, Ohio State U. student; 
currently in unpaid volunteer status with academic credit for research). 

Task 4 Identification, recruitment, and retention of cancer bearing and control dogs: This task 
was directed by PI Alvarez, with support in the area of clinical data/health care records from PI 
Kisseberth. Original Tasks 4a, b, c, d, e and f all refer in part to non-inventory dogs at Lackland 
AFB. In our Year 2 Annual Report, we explained that MWDs from outside Lackland AFB were 
expected to not be necessary because acquisition of dogs with different environmental exposures was 
possible at MWD program at Lackland AFB and the extended MWD cohort housed at the Medina 
Annex, both in San Antonio. 

This task is well advanced, especially in the sense of acquisition of health care records from dogs 
prioritized by being cancer positive, high-cancer risk by family occurrence or of advanced age. The 
MWD subjects were selected from the full MWD population of Belgian Malinois. To date, 32 MWD 
blood samples were collected at Lackland AFB and shipped to Dr. Alvarez’s lab (NCHRI, Columbus, 
OH). Collection is ongoing. 

Original Tasks 4a, b, d, e and f are technical, regarding setting up infrastructure and protocols. 
Original Task 4c refers to identification and ascertainment of dogs for the present study. A major 
advance in this area was the acquisition of the list of dogs for which there are pathology reports (i.e., 
processed by Joint Pathology Center (JPC)/Armed Forces Institute of Pathology (AFIP); those dogs 
are highly enriched for cancer-positive status. We were thus able to directly acquire the health records 
of those dogs, scan them and input to our database. Those records are for dogs that are not available 
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for collection of biological samples at Lackland AFB. They are useful for epidemiological 
information relevant to cancer in the MWD population, and to ultimately analyze together with that 
for the dogs used in the planned genetic studies. 

Task 5: Molecular Characterization of cancer bearing and control dogs: Original Tasks 5a, b, c, 

d, and e have two aspects - 1) molecular-technological development/implementation and validation, 
and 2) execution of the final molecular analysis once the full set of samples are collected. Notably, 
the latter cannot be done as the samples are collected in order to avoid experimental variability and 
batch effects. 

Detailed description and discussion of this task was provided in the first and second year annual 
reports. Those studies addressed implementation and validation of most molecular methodologies that 
we will use in this project. That included the critical molecular analyses that will be done first when 
all the biological samples have been collected. Most importantly, that includes the isolation and QC 
of high-quality DNA for SNP genotyping and DNA methylation analysis. We have also established 
SNP genotyping expertise and development of sequencing-based marker genotype validation and 
targeted DNA methylation assays that will be necessary following the GWA and DNA methylation 
analyses. The only development in this area in this reporting period was the collection of the majority 
of the biological samples acquired to date (current n=32). 

Task 6 - Adaptation of existing resources, data storage and hosting: This Task was directed by 
both Pi’s Huang and Alvarez, and the work was done by programmer J. Aaronson (who reported to PI 
Huang) working closely with PI Alvarez. Tasks 6a, b, c and d are technical - specifically listing 
each element of functionality to be added to the database of MWD health records and molecular data. 
In the two previous annual reports, we described the development of the virtual database server for 
this project and the implementation of software to convert paper health care records to digital. This 
reporting period we continued to import health care records data and to maintain the database. 

Task 8- Project management, Quality control and assurance, and Security: This task continues 
smoothly and without significant change as reported in the previous annual reports. 

ii. Research goals: Selection, development and validation of the most ideal analyses (for execution of 
the final study of genetic and gene-environment risk factors for cancer in MWDs) - i.e. when the 
sample collection and molecular and statistical analyses are performed) 

Task 3: “Development of Principal Components (PCs) and pedigrees” Task 3 is advanced to the 
point it can be implemented once the final experimental MWD genotype data and familial 
relationship documentation are completed for the selected dogs. In this reporting period, we have 
begun collecting biological samples. These samples will be processed and will put through a pipeline 
of the methods we have developed in previous years. 

Original Tasks 3a, c, d: Principal Components Analysis (PCA) and pedigrees (3b was a related 
technical task, adding pedigree capabilities to MWD database): These all refer to methods and 
experimental design for understanding animal relatedness and population structure. This issue is 
unique to dog breeds and in particular to the MWDs because they are unlike other canine cohorts in 
their “evohitionary’Vhuman-purposed selection/breeding (on top of the fact that dog breed population 
genetics are different than human and mouse, and are not uniform and generalizable because different 
breeds have vastly different population-genetic characteristics. 
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In the present reporting period we further developed our expertise in this area by analyzing the 
new dataset of Golden Retrievers with or without one of two cancer types (B cell lymphoma and 
hemangiosarcoma) from Tonomura et al. PMID: 25642983.This work is a good proxy for the present 
study because it i) represents multiple cancer types in one breed, and ii) it requires addressing 
population structure within an single breed. Included in these studies, we conducted GWA for each 
cancer separately, and both together (Appendix 1). We also formalized aspects of our understanding 
of canine breed genetics in the study design for an NIH R01 application that we submitted within the 
first quarter of this reporting period (this was included in the previous, year 3, annual report as 
Appendix 11). 

Original Task 3e: Development for publication of technical and analytical novelties/ strengths 
of the current project for dissecting complex genetics and gene-environment interactions: In this 
reporting period, we continued to optimize and validate the biostatistical methods to determine 
genomewide significance of association. This is extremely important because it relates to the main 
challenges in complex genetics: mainly, multiple testing correction (and its associated loss of power 
to detect all associations but those of large effect sizes) and latent variables (e.g., population structure 
- addressed in previous section - and its resulting high false positive rate). 

In this reporting period, we continued to formalize and optimize our methods developed in 
previous years - including the use of genomewide Intersection Union Testing association (or GIA). 

To do this, we expanded the types of populations analyzed - adding genotype datasets from the 
human HapMap resource (http://hapmap.ncbi.nlm.nih.gov/index.html.en) and from the largest dog 
genetics study to date (915 dogs from 80 breeds; Boyko et al., PMID: 20711490). We improved our 
ability to implement our analyses by developing new computer scripts that automate the analysis 
pipeline. We also developed and successfully implemented an analysis that is similar to versions of 
“identity by descent mapping” that have previously been used in human studies (manuscript #2 in 
bullet list below) 

In the previous reporting period, we initiated epistasis analysis. That was continued in the present 
reporting period (Appendix 2, Figs. 1, 2). In addition, we developed new concepts and extensive 
statistical analyses regarding gene-environment interactions, and dog traits that could be associated 
with cancer risk (e.g., metabolic rate, lifespan and size). As we began discussing in the Q1 quarterly 
report and expanded on in later quarterly reports, new high impact publication suggests novel ways to 
think about the genetic and environmental contributions to cancer risk (Tomasetti C. and Vogelstein, 
B. Science 347:78-81; PMID: 25678653). In brief, they classified cancer as comprising (arbitrarily) 
two groups - Deterministic (significantly affected by genetic and environmental factors) and 
Replicative (dependent on stochastic somatic mutation resulting from the number of cell divisions, 
i.e., lifetime number of stem cells and their proliferation rate). The investigators note that in cancer 
epidemiology, “environmental” refers to anything not genetic. The stochastic processes associated 
with tissue development and maintenance are thus grouped with external environmental effects. This 
in turn suggests how those stochastic processes - in effect the total number of stem cells and their 
number of divisions - are likely to be affected by diverse genetic and environmental factors, 
beginning with those that influence size and weight. The information from this cited article can be 
used in this project to i) attempt the classification of military dog cancers into Deterministic or 
Replicative for selection of the best cancer types (most Deterministic), or ii) identify genetic or 
environmental contributions to Replicative cancer potential, such as mapped loci associated with dog 
size and weight (Appendix 3). 

The development of this way of thinking of gene-environment interactions was a major area of 
productivity for our group in this reporting period (Appendices 3-6). Two manuscripts were 
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developed in this period (Appendices 5, 6) and submitted for publication a few months after the end 
of this reporting period. 

Task 7: Pathway analysis and functional characterization. 

Original Task 7a describes how the final analysis will use a blocking design to detect genetic and 
environmental effects once the final molecular genotyping is performed. 7b describes how 
Intersection Union Testing/GIA will be used to analyze the final genetic and epigenetic molecular 
data generated. And 7c and d describes how Kaplan Meyer analysis and Cox proportional hazard 
ratios will be calculated on the final data generated from molecular analysis of MWDs. 

Additionally, the overarching Task name and Tasks 7a, b and d refer to pathways and other broad 
aspects such as bioinformatics and functional categories. As reported in the previous three annual 
reports, this task is advanced to the point that the planned analyses can be directly implemented on 
the final study data. In this reporting period, we continued to conduct broad cancer datamining studies 
that will be relevant to our analyses. These types of analyses are likely to dramatically enhance our 
canine studies, and could serve to support our findings. This is because there is vastly more human 
data available, especially in the area of somatic genetics and biochemical pathway/systems analysis. 

In this reporting period, Dr. Alvarez co-authored two relevant studies. In collaboration with Dr. 

G. Lorch and other OSU colleagues, Dr. Alvarez published a study of canine lung cancer (Clemente - 
Vicario et al. PMID: 26560147; Appendix 7 of this report). Dr. Alvarez’s roles were to i) lead the 
Comparative Genomic Hybridization genetic assays and their analysis (which was done together with 
Dr. Jennie Rowell, OSU College of Nursing and former member of this project as a PhD student in 
Dr. Alvarez’s lab); and ii) to conduct other analyses to determine the cancer relevance of observed 
alterations (mainly by datamining in human cancer genomics resources). In collaboration with Dr. 
Curtis Bird and his trainee, Dr. Alvarez also published an article on mammary cancer: Lutful Kabir, 
Alvarez and Bird, Vet. Sci. 2016, 3(1), 1; doi:10.3390/vetsci3010001 (registering DOI; Appendix 8 of 
this report). In this work, Dr. Alvarez conducted the phylogenetic analysis and other bioinformatics 
and evolutionary analyses. 

In this reporting period, Dr. Alvarez was the communicating author on a comprehensive book 
chapter on canine translational genomics, as well as one of the authors of the introduction to the book 
(accepted for publication in 2016); Partnering PI W. Kisseberth and former project member Jennie 
Rowell are co-authors; note this is a high impact book series with wide circulation: 

Martic-Kehl, M., Festing, M. F. W., Alvarez, C. E., Schubiger, P. A., Introduction to Animal Models 
for Human Cancer: Discovery and Development of Novel Therapeutics; in Methods and 
Principles in Medicinal Chemistry series, Wiley-VCH, In press. (Appendix 9) 

Fenger J. M., Rowell, J. L., Zapata, I., London C. A., Kisseberth W. C., Alvarez, C. E. (2015) Dog 
models of naturally occurring cancer; in Animal Models for Human Cancer: Discovery and 
Development of Novel Therapeutics Methods and Principles; Medicinal Chemistry series, Wiley - 
VCH, In press. (Appendix 10) 

This work includes a section titled “Necessary developments for realizing the potential of canine 
models”, with subsection “Epidemiology, longitudinal cohorts, tissue repositories and integrative 
genomics”. This section brings attention to the approaches behind the present effort and cite this 
study through the DoD CDMRP annual report. That section concludes with the following: “An ideal 
longitudinal cohort should make it possible to gather the following data in electronic form: clinical, 
environmental, pedigree, genetic and molecular phenotypes.” 

C. What opportunities for training and professional development has the project provided? 
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Dr. Isain Zapata has trained as a postdoctoral fellow through this reporting period. He conducted the statistical 
genetic analyses reported here for Task 3. He was first author on two manuscripts developed in this period 
and submitted shortly after (Appendices 5, 6). He also presented posters on that work at two internal events, 
as reported in Q3 and Q4 quarterly report appendices (Appendix 4). 

Dr. Alvarez continues to formally mentor Dr. Jennie Rowell (former project member as PhD student in 
Dr. Alvarez’s lab). She is now a tenure track faculty member in the OSU College of Nursing. In this reporting 
period, Dr. Rowell was second author on the book chapter described in B.ii.Task 7, last paragraph. Dr. 

Alvarez also was a member of a working group for a grant application being developed by Dr. Rowell with 
colleagues in her college. 

D. How were the results disseminated to communities of interest? 

We have begun to draw attention to this project in publications and international presentations (Symposium: 
Cancer Drug Development - Predictability of Animal Experiments, Collegium Helveticum, Zurich, 
Switzerland). That in turn resulted in an invited book chapter below, which is described in B.ii.Task 7, last 
paragraph. 

i. We will draw attention to this project in the following book-series publication that include former project 
member Dr. J. Rowell; Partnering PI Dr. W. Kisseberth; and as communicating author, Lead PI Dr. 
Alvarez: 

Martic-Kehl, M., Festing, M. F. W., Alvarez, C. E., Schubiger, P. A. (2016) Introduction to Animal 
Models for Human Cancer: Discovery and Development of Novel Therapeutics; in Methods and 
Principles in Medicinal Chemistry series, Wiley-VCH, In press. (Appendix 9) 

Fenger J. M., Rowell, J. L., Zapata, I., London C. A., Kisseberth W. C., Alvarez, C. E. (2016) Dog 
models of naturally occurring cancer; in Animal Models for Human Cancer: Discovery and 
Development of Novel Therapeutics; in Methods and Principles in Medicinal Chemistry series, 
Wiley-VCH, In press. (Appendix 10) 

ii. We have also drawn attention to this project at local university presentations: 

OSU Humanities & Cognitive Sciences High School Summer Institute, lecture on behavioral 
and canine genetics (Aug 13, 2015) (Appendix 11) 

OSU VME 6540 Structure & Function of Cells (Cell Biology), Lecture on canine genetics & 
genomics (Oct 5, 2015) (Appendix 12) 

iii. The following articles on canine cancer were published: 

Lutful Kabir, F.M., Alvarez, C.E. and Bird, R.C., Canine Mammary Carcinomas: A Comparative 
Analysis of Altered Gene Expression, Vet. Sci. 2016, 3(1), 1; doi:10.3390/vetsci3010001 
(registering DOI) (Appendix 8) 

Clemente-Vicario F, Alvarez CE, Rowell JL, Roy S, London CA, Kisseberth WC, Lorch G. Human 
Genetic Relevance and Potent Antitumor Activity of Heat Shock Protein 90 Inhibition in Canine 
Lung Adenocarcinoma Cell Lines. PLoS One. 2015 Nov ll;10(ll):e0142007. doi: 
10.1371/joumal.pone.0142007. eCollection 2015. PubMed PMID: 26560147 (Appendix 7) 

E. What do you plan to do during the next reporting period to accomplish the goals? 

i. We plan to publish some of the work previously submitted and under revision (GIA methods manuscript), 
and we plan to submit other articles, including those mentioned as manuscript #1 and #2 above (Task 
B.ii.Task 3). 

ii. We plan to complete the collection of MWD biological samples and conduct the planned molecular 
genetic studies. 
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4. Impact 

A. What was the impact on the development of the principal discipline(s) of the project? 

Our publication Alvarez, CE, ILAR J 2014 has already been prominently referenced: 

This citation is important because it is in a high impact journal read by the broad field of biomedical 
research: Science Translational Medicine: Kol A, Arzi B, Athanasiou KA, Farmer DL, Nolta JA, 
Rebhun RB, Chen X, Griffiths LG, Verstraete FJ, Murphy CJ, Borjesson DL. Companion animals: 
Translational scientist's new best friends. Sci Transl Med. 2015 Oct 7;7(308):308ps21. doi: 

10.1126/scitranslmed.aaa9116. Review. PubMedPMID: 26446953. 

In “Comparative oncology: what dogs and other species can teach us about humans with cancer”. 
Schiffman JD, Breen M. Philos Trans R Soc Lond B Biol Sci. 2015 Jul 19;370(1673). pii: 20140231. 
This publication cites us (ref. #3) in the Introduction, which starts “Comparative oncology is a 
quickly expanding field ... The study of naturally occurring cancers in the domestic dog provides a 
suitable model for advancement of the understanding, diagnosis and management of cancer in 
humans [1-4].” 

In “The Golden Retriever Lifetime Study: establishing an observational cohort study with 
translational relevance for human health”. Guy MK, Page RL, Jensen WA, Olson PN, Haworth JD, 
Searfoss EE, Brown DE. Philos Trans R Soc Lond B Biol Sci. 2015 Jul 19;370(1673). pii: 20140230. 
These investigators wrote about our paper (ref#9): “Remarkable progress has been made in the last 10 
years, ... to begin to investigate the multifactorial, complex nature of... cancer development in dogs. 
A recent review provides an interesting and salient assessment of this area [9].” 

We also expect that our newly developed statistical methods (mainly GIA) and trait mapping studies 
(submitted manuscripts #1 and #2 mentioned above) will have a strong impact on the field of genetics. 

B. What was the impact on other disciplines? 

Our work has drawn attention from the fields of comparative biology and animal models of human disease. 
While the latter is clear in the area of cancer genetic mapping, there are also important aspects regarding 
epidemiology, human disease relevance and pharmacology. The appreciation of these is demonstrated by the 
PI being invited to contribute a chapter to a book largely dedicated to reproducibility in research using animal 
models to develop and validate new pharmacological therapies, in the Wiley Methods and Principles in 
Medicinal Chemistry series (Fenger et al. 2016 above; Appendix 10). 

C. What was the impact on technology transfer? 

Nothing to report 

D. What was the impact on society beyond science and technology? 

There is a high level of interest in MWDs in the US population at large, especially in military families that 
understand the MWDs are the most effective way to mitigate the major cause of morbidity and mortality in 
deployed military personnel - exposure to explosions. We predict the present project to understand cancer in 
that population will gain media and popular attention. 

5. Changes/problems 

Nothing to report 

6. Products 

A. Publications, conference papers, and presentations 
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Martic-Kehl, M., Festing, M. F. W., Alvarez, C. E., Schubiger, P. A. (2016) Introduction to Animal Models 
for Human Cancer: Discovery and Development of Novel Therapeutics; in Methods and Principles in 
Medicinal Chemistry series, Wiley-VCH, In press. (Appendix 9) 

Fenger J. M., Rowell, J. L., Zapata, I., London C. A., Kisseberth W. C., Alvarez, C. E. (2016) Dog models of 
naturally occurring cancer; in Animal Models for Human Cancer: Discovery and Development of Novel 
Therapeutics; in Methods and Principles in Medicinal Chemistry series, Wiley-VCH, In press. (Appendix 
10 ) 

Lutful Kabir, F.M., Alvarez, C.E. and Bird, R.C., Canine Mammary Carcinomas: A Comparative Analysis of 
Altered Gene Expression, Vet. Sci. 2016, 3(1), 1; doi:10.3390/vetsci3010001 (registering DOI) (Appendix 
8 ) 

Clemente-Vicario F, Alvarez CE, Rowell JL, Roy S, London CA, Kisseberth WC, Lorch G. Human Genetic 
Relevance and Potent Antitumor Activity of Heat Shock Protein 90 Inhibition in Canine Lung 
Adenocarcinoma Cell Lines. PLoS One. 2015 Nov ll;10(ll):e0142007. doi: 
10.1371/joumal.pone.0142007. eCollection 2015. PubMed PMID: 26560147 (Appendix 7) 

Zapata, I., Alvarez, C.E. Genome Mapping of Aggression and Fear Traits in Dogs. The Research Institute at 
Nationwide Children’s Hospital Poster Presentation, April 2015 
Zapata I., Serpell, J.A., Alvarez, C.E., Genetic mapping of canine fear and aggression. The Research Institute 
at Nationwide Children’s Hospital, Annual Retreat (poster presentation) November 2015 (Appendix 4) 
OSU Humanities & Cognitive Sciences High School Summer Institute, lecture on behavioral and 
canine genetics (Aug 13, 2015) (Appendix 11) 

OSU VME 6540 Structure & Function of Cells (Cell Biology), Lecture on canine genetics & genomics 
(Oct 5, 2015) (Appendix 12) 

B. Website(s) or other Internet site(s) 

Nothing to report 

C. Technologies or techniques 

Further developed GIA statistical method for genomewide association of genetic or genomic data; manuscript 
was submitted and is under revision. Developed and implemented an adaptation of identity by descent 
mapping; manuscript was submitted shortly after end of this reporting period (referred to as manuscript #2 
above). 

D. Inventions, patent applications, and/or licenses 

Nothing to report 

E. Other Products 

We developed and validated a new computer script to automate an analysis pipeline that constitutes our GIA 
methodology 

Participants & other collaborating organizations 

A. What individuals have worked on the project? 

Carlos E. Alvarez, PhD, unchanged 

Kun Huang, PhD, unchanged 

Bill Kisseberth, DVM, PhD, unchanged 

Jacob Aaronson (programmer reporting to PI Huang), unchanged 
Isain Zapata, PhD (postdoctoral fellow), unchanged 
Michelle Garcia (formerly Perez), RVT, unchanged 



B. Has there been a change in the active other support of the PD/PI(s) or senior/key personnel since the 
last reporting period? 

Nothing to Report 

C. What other organizations were involved as partners? 

Dr. J. Serpell of University of Pennsylvania is now a collaborator on our new studies that were submitted for 
publication and referred to above as manuscript #1 (Appendix 5). 

8. Special reporting requirements 

This is a Collaborative Award and the report is written to document all three investigators’ roles. All work was 

done by PI Alvarez and his staff except where noted in the header that another PI and their staff were involved 

(Huang, Task 6; and Kisseberth, Tasks 2 and 4). 

9. Appendices: 

1) New Intersection Union Test genome wide association analysis implementations: i) multi-cancer-type GWA 
in one breed & ii) single breed population structure 

2) Continuation of refinement of genomewide analysis: i) epistasis & ii) relationship between size and lifespan 

3) Consideration of new ways of thinking about gene-environment interactions 

4) Analysis of novel gene-environment interactions possibilities: Poster presentations: Zapata et al. Genetic 
mapping of canine fear and aggression 

5) Analysis of novel gene-environment interactions possibilities: Draft manuscript: Zapata, Serpell and 
Alvarez: Genetic mapping of canine fear and aggression 

6) Analysis of novel gene-environment interactions possibilities: Zapata and Alvarez: Fine-mapping and 
biological relevance implicate GNAT3 and IGSF1 in canine fear and aggression 

7) Published article: Human Genetic Relevance and Potent Antitumor Activity of Heat Shock Protein 90 
Inhibition in Canine Lung Adenocarcinoma Cell Lines 

8) Published article: Canine Mammary Carcinomas: A Comparative Analysis of Altered Gene Expression 

9) Accepted manuscript draft: Introduction to Animal Models for Human Cancer: Discovery and Development 
of Novel Therapeutics 

10) Accepted manuscript draft: Dog models of naturally occurring cancer; in Animal Models for Human Cancer 

11) Presentation: OSU Humanities & Cognitive Sciences High School Summer Institute 

12) Academic lecture/curriculum development: OSU VME 6540 Structure & Function of Cells (Cell 
Biology), Lecture on canine genetics & genomics 
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Appendix 1 


New Intersection Union Test genomewide association analysis implementations: 
i) multi-cancer-type GWA in one breed & ii) single breed population structure 


We initiated reanalysis of a new dataset that represents an attractive proxy of the military dog 
data we will ultimately analyze: 

Tonomura N, Elvers I, Thomas R, Megquier K, Turner-Maier J, Howald C, Sarver 
AL, Swofford R, Frantz AM, Ito D, Mauceli E, Arendt M, Noh HJ, Koltookian M, 

Biagi T, Frye S, Williams C, Avery AC, Kim JH, Barber L, Burgess K, Lander ES, 

Karlsson EK, Azuma C, Modiano JF, Breen M, Lindblad-Toh K. Genome-wide 
Association Study Identifies Shared Risk Loci Common to Two Malignancies in 
Golden Retrievers. PLoS Genet. 2015 Feb 2;ll(2):el004922. doi: 

10.1371/journal.pgen. 1004922. eCollection 2015 Feb. PubMed PMID: 25642983 

This data is ideal because, unlike previous available data, it examines more than one type of 
cancer (B-cell lymphoma and hemangiosarcoma) in a single cohort or breed (Golden Retriever; 
approximately 180 affected and 170 controls). This dataset is thus analogous to our Belgian 
Malinois pan-cancer study. Because our new method (GIA, genomewide intersection union test 
association) is well suited to single marker analysis, our initial studies have been to compare 
such analysis to the conventional analysis reported in that study (“collapsing method” whereby 
multiple testing burden is dramatically reduced by using linkage disequilibrium-defined 
haplotype blocks instead of single SNP markers). A second aspect of our reanalysis addresses 
the distinguishing characteristic of GIA - the fact that its statistical mechanism is akin to 
conducting meta-analylsis (of subsets of a single dataset). Our first analysis shown in Appendix 
1 Fig. 1 begins to address both of those aspects using conventional GW AS analysis in PLINK - 
i.e., 1) single marker analysis of the two cancer types separately, and 2) combined analysis 
whereby both cancer types are combined in a cancer vs. normal GW AS. 



-LoglO (P-value) 


Appendix 1. Figs. 1, 2, 3. Mapping 2 cancers in 
one breed 


Golden Retrievers 
BLSA, B cell lymphoma 
HAS, hemangiosarcoma 

Arrows denote upstream or downstream of a peak SNP 

Analysis of genomewide association using conventional methods (in PUNK) 

Dataset from Tonomura et al. PMID: 25642983 
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Appendix 2 


Continuation of refinement of genomewide analysis: 
i) epistasis & ii) relationship between size and lifespan 


In this reporting period, we continued refining and validating our novel methods to conduct 
genetic (genomewide association analysis or GW AS) and multidimensional analysis: 
Genomewide Intersection-Union-Testing (IUT) Analysis (GIA). The latest efforts in this area are 
described with figures in Appendix 1. In this reporting period, we initiated studies of new 
datasets that provide new opportunity to test and validate our methods. Those include human 
genetic data from HapMap (http://hapmap.ncbi.nlm.nih.gov/index.html.en). This is desirable 
because the extent of linkage disequilibrium (LD) is reduced in the range of 50-fold compared to 
humans; that allows us to test our approach on the other end of the spectrum from pure breed 
data with exaggerated LD. At the same time, that data has the benefit of being dramatically 
stratified like dog breeds, because it is made up of genotypes from select human populations 
throughout the world - some closely related, others distantly related. Unlike most human genetic 
data, this dataset is not highly regulated because of the anonymous design of the project. In 
addition, we requested and received the dataset from the largest dog genetic study published to 
date (PLoS Biol. 2010 Aug 10;8(8):el000451.): 915 dogs from 80 breeds. This allows for a 
different set of partially overlapping dogs as reported by Vaysse et al. (LUPA consortium, cited 
above), and including several of the same traits. The ongoing analyses are encouraging (see 
Appendix 2, Figures and text). 

In addition to developing and optimizing the statistical methods (Appendices 1, 2), we 
have initiated the design of improvements of the computer script to automate the GIA analysis. 
The script has been validated and implemented by repeating previous analyses. Going forward, 
we plan to optimize it further and add functionalities that will further accelerate our GIA 
analyses - including creation of an interface that allows selection of variable parameters. For 
example the current version of the automated script can only analyze case-control data; we thus 
plan to add the option of analyzing quantitative trait loci (QTL). 

In this reporting period, we initiated discussions of analysis options for gene-environment 
interactions. In addition, we recognize that a similar effect can pertain to non-cancer clinical 
traits or disease. This can be important because there can be both positive or negative 
correlations between cancer and such traits/diseases. And some of those relationships can be 
predicted to have effects on cancer rate. Examples include age- or metabolism- related traits that 
indicate advanced aging, or obesity or larger body size of some dogs vs. others (Appendix 2, 

Figs. 3, 4). 



Epistasis analysis 

Figure 1 (right). Epistatic interactions between individual SNPs 
within the relevant lifespan-associated QTLs identified through 
intersect union test GWAS. This plot shows the number 
significantly correlated SNPs within the relevant locus. The more 
interactions there are (the thicker the line) the more correlation 
exists among the locus. Higher correlation may be suggestive of 
a interaction of effects. Within hits, the one in chromosome 10 
appears to be the largest one. Data source: Vaysse et al. 2011. 



Figure 2 (bottom). Epistatic interactions between the individual SNPs within the relevant lifespan 
associated QTLs identified though intersect union test GWAS against a list of genes involved in telomere 
regulation. Telomere length has been shown to be highly predictive of life expectancy. This plot 
evaluated the epistatic interactions between the lifespan associated QTLs and telomere regulation 
genes. The thicker the line, the larger the correlation that exists. The most relevant epistatic association 
detected was the hit in chromosome 10, which has several associations that are evident but especially 
to EXOl. 
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Interbreed Size vs Lifespan association. 


Two main studies have been published in the past where the interbreed size variation was evaluated in 
a GWAS setting. Those studies are Boyko et al. 2010 and Vaysse et al. 2011.Both studies have similar 
conclusions in regards to size but none of them has been evaluated for lifespan QTL associations. We 
finally gained access to both datasets so they can be compared. Both datasets were evaluated originally 
using different methodologies that are not easy to compare. We began by analyzing the traits of Size 
and Lifespan using our intersect union test GWAS. All QTLs are evaluated on stereotypical phenotypes o 
each breed. 


Figure 3 (right). Empirical linear 
regression curves were constructed 
to evaluate the distribution of the 
selected breeds to be compared. 
Each study used a different set of 
breeds. Boyko et al. 2010 included 
80 breeds represented by 
approximately 10 individuals per 
breed. While Vaysse et al. 2011 
included 30 breeds represented by 
approximately 16 individuals per 
breed. The datasets only contained 
19 breeds that were common in 
both studies. The empirical linear 
regression equation and R-squared 
value are very close to each other 
even when the breed population is 
different. This evaluation suggests 
the presence of some outliers that 
may be negatively impacting the 
Q.TL detection efficiency, (i.e. the 
English Bulldog has a very short 
lifespan for what was expected for 
his weight). It will be further 
evaluated if the removal of outlier 



Lifespan vs Size (Vaysse et al. 2011) 
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breeds improves the efficiency of the detection. 



Figure 4 (Bottom). Manhattan plots of Size and Lifespan QTLs using the 4 group intersect union test 
GWAS. In the case of Size, the main hit in chromosome 15 and the X chromosome become quite evident 
in both datasets; however, the Vaysse dataset detected some addional signal coming from 
chromosomes 4, 7 and 10 that are not that evident the Boyko dataset. Boyko dataset included several 
lager breeds that are not represented in the Vayse dataset. That feature may be hiding some of the 
other relevant hits. In terms of Lifespan, hits are not that evidently clear as compared to size, there is no 
main hit that is common in both datasets besides chromosome 5 and 10. In this case it may also be 
important to evaluate if the breed population representation can be altering the findings. 
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Appendix 3 


Consideration of new ways of thinking about gene-environment interactions 


In this reporting period, we have devoted significant effort to considering a new way of thinking 
about gene-gene and gene-environment questions in our present study. A recent high-impact 
research article proposed a new way of thinking about cancer risk; this has generated a great deal 
of discussion: 

Tomasetti C, Vogelstein B. Cancer risk: role of environment—response. Science. 2015 Feb 
13;347(6223):729-31. doi: 10.1126/science.aaa6592. Epub 2015 Feb 5. PubMed PMID: 
25678653. 

“If hereditary and environmental factors cannot fully explain the differences in 
organ-specific cancer risk, how else can these differences be explained? Here, we 
consider a third factor: the stochastic effects associated with the lifetime number of 
stem cell divisions within each tissue. In cancer epidemiology, the term 
“environmental” is generally used to denote anything not hereditary, and the 
stochastic processes involved in the development and homeostasis of tissues are 
grouped with external environmental influences in an uninformative way. We show 
here that the stochastic effects ofDNA replication can be numerically estimated and 
distinguished from external environmental factors. Moreover, we show that these 
stochastic influences are in fact the major contributors to cancer overall, often more 
important than either hereditary or external environmental factors. 

That cancer is largely the result of acquired genetic and epigenetic changes is 
based on the somatic mutation theory of cancer (9-13) and has been solidified by 
genome-wide analyses (14-16). The idea that the number of cells in a tissue and their 
cumulative number of divisions may be related to cancer risk, making them more 
vulnerable to carcinogenic factors, has been proposed but is controversial (17— 

19)...” 

Given the focus of our study on identifying genetic and environmental contributions to cancer 
risk in the military dog population, that way of thinking about cancer risk presents some strong 
candidates for gene-environment interactions that could be relevant. Those include dog size (or 
growth rate, duration of growth period, or other metabolic traits), aging rate (as cancer is 
associated with advanced age) and aggression (because the military dogs are bred for aggression, 
aggression is associated with stress which is associated with cancer, and it represents a possible 
gene/amimal-environment (incl humans and other dogs) interaction). In this reporting period, we 
conducted analyses on cross-breed association of aggression and genetic markers (using 



published datasets of breed genotypes (Boyko et al. PMID: 20711490; Vaysse et al. PMID: 
22022279) and phenotypes (c-BARQ resource; McGreevy et al. PMID: 24358107)). Strikingly, 
we were able to map genomic loci associated with aggression across diverse dog breeds 
(Appendix I). Because the analysis is across breeds, the mapping is not subject to the extreme 
linkage disequilibrium distances within dog breeds - and thus serves as fine mapping to strongly 
implicate single genes. 

To refine the aggression trait itself, we first conducted Principal Components Analysis 
(PCA) of broad behavioral traits (as breed stereotypes). This revealed a cluster of three related 
traits - dog-directed aggression, human stranger-directed aggression and fear of human stranger. 
As a result, we predicted that genetic mapping of those traits separately could yield loci shared 
by those traits. Strikingly, that is what we found (Appendices 4-6). Moreover, the genes 
implicated - GNAT3 and IGSF1 - are exclusively dedicated to sensory perception (we 
hypothesize in pheromone sensation in the vomeronasal organ or olfactory epithelium) and the 
hypothalamic-pituitary axis that drives the stress response. We were able to show this effect in 
two genotype datasets with different breeds, and to establish that it is associated with the three 
aggression/fear traits mentioned above (but not universally across other behavioral traits). These 
findings are very encouraging because they suggest 1) aggression can be mapped, and 2) this 
particular cluster of aggression/fear traits are explicitly associated with gene-environment 
interactions (environment being non-familiar humans/animals). Thus these traits appear to be 
highly relevant to our study because the dogs are bred and trained for aggression. 



Appendix 4 


Analysis of novel gene-environment interactions possibilities 
Poster presentations: Zapata et al. 


Explained in Appendix 3 text. 

Poster presentations of this work: 

Zapata, I., Alvarez, C.E. Genome Mapping of Aggression and Fear Traits in Dogs. The Research 
Institute at Nationwide Children's Hospital, April 2015 

Zapata I., Serpell, J.A., Alvarez, C.E., Genetic mapping of canine fear and aggression. The 
Research Institute at Nationwide Children's Hospital, Annual Retreat, November 2015 
(poster media shown on next page) 
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Genome Mapping of Aggression and Fear Traits in Dogs 

Isain Zapata & Carlos E. Alvarez 

Center for Molecular and Human Genetics, The Research Institute at Nationwide Children’s Hospital, Columbus, OH 


Introduction 


Results 


Findings 


Genomewide genetic associations (GWAS) of behavioral traits is difficult in 
humans. This is due to heterogeneity, biological complexity, and the 
challenge of phenotyping large numbers of individuals. Thus, little is known 
about the genetics and molecular biology of behavior. 

The genetic history of dogs has resulted in hundreds of isolated 
populations with diverse morphological and behavioral traits. Those various 
breed stereotypes can be relatively easily ascertained through i) direct 
phenotyping of many individuals from diverse breeds, or ii) questionnaire- 
based surveys of very large numbers of diverse pedigree dogs. 

We are identifying common gene variations that are associated with 
different behaviors (such as aggression and fear) across dog breeds. We 
propose that knowledge of their genetic basis will enable us to dissect many 
of these behaviors and their correlations. That new understanding will lead to 
novel strategies for improving human mental health. 

Objective 


Identify relevant genome regions associated with 
behavioral traits of aggression and fear in the dog. 


Methods 


Phenotypes 

■ C-BARQ values. Behavioral means by breed of the 30 most popular 
breeds by AKC. (Serpell et al. 2014) 

■ Owner reported behavioral data of AKC registered dogs. 

■ -175 subjects per breed. (Min: 49-Bulldog Max: 1120-Labrador Retriever) 

■ Aggression decomposed into 4 classes 

■ Fear decomposed into 5 classes 

Datasets 

■ Boyko dataset: 327 subjects from 29 Breeds, -49,000 SNP Afifymetrix v.2 
Canine array (Boyko et al. 2010). 

■ Vaysse dataset: 150 subjects from 11 breeds, -178,000 SNP Illumina 
Canine HD array (Vaysse et al. 2011). 

Analysis 

■ All descriptive statistics performed on SAS v.9.3. 

■ GWAS performed using GEMMA v.0.94.1 (Zhou et al. 2012). 

■ Population structure removed by centered relatedness matrix correction. 

■ Association test performed using Univariate Linear Mixed Model using 
the likelihood ratio test. 

> Genomewide significance P-value cutoff at <lE-5 for the Boyko dataset 
& <lE-8 for the Vaysse dataset. 
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Conclusions 


• Aggression and fear-predisposition are correlated - both 
by PCA and GWAS 

• We identified four distinct regions highly associated with 
aggression and fear; and have a solid gene candidate for 
three of those 
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Appendix 5 


Analysis of novel gene-environment interactions possibilities: 
Draft manuscript: Zapata, Serpell and Alvarez 

See discussion of rationale for relevance to cancer risk and this project in Appendix 3. 
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One Sentence Summary: Genome scanning across dog breeds reveals that different 
genome loci are associated with two clusters of fear and aggression traits. 


Abstract: 

Little is known about the genetics of fear and aggression in humans and other mammals. 
We conducted genomewide association (GWA) mapping of breed stereotypes for many 
fear and aggression traits across several hundred dogs from diverse breeds. Those 
findings were confirmed using a second cohort of partially overlapping breeds. Lastly, we 
used the validated loci to create a model that effectively predicted fear and aggression 
stereotypes in a third group of dogs from breeds that were not involved in the mapping 
studies. Our major findings are that i) owner directed aggression and dog rivalry are 
associated with variation for small body size, and ii) two loci, on chromosomes 18 and X, 
are associated with several traits, including fear and aggression that is directed toward 
dogs and human strangers. We discuss the biological relevance of genes at the GWA 
peaks and the possibility that those variants were involved in the domestication process. 


Main Text: 

Introduction 

It is difficult to perform genomewide genetic association studies (GWAS) of human 
behavior. This is due to heterogeneity, biological complexity, ambiguous phenotype 
classifications and the challenges of phenotyping large numbers of individuals. As a 
result, very little is known about human behavioral genetics. Most of the progress has 
been driven by the availability of epidemiological data of medical relevance: smoking 
behavior (i), coffee consumption behavior (2), alcohol drinking behavior (3), and mental 
disability (4) or illness (5). In contrast, there has been limited exploration of common 
complex-behaviors such as aggression (6), happiness (7) or social phobia (8). 



Largely due to heterogeneity, most complex traits are difficult to map in humans. 
Dozens to thousands of variants can each contribute minute amounts to heritable risk, and 
this can differ dramatically in different ethnicities and their subgroups. Many major 
breakthroughs in human genetics have resulted from studying isolated populations and 
multigenerational families. The advantages of those latter approaches, and others, are 
dramatically exaggerated in dogs: (i) There are approximately 400 dog breeds, each on 
the order of 100-fold less genetically-complex than the full population. Thus, compared 
to humans and their major ethnic groups, dogs are much more similar within breeds and 
much more different across breeds, (ii) Dogs are often part of a family or working 
environment and receive high levels of health care. Lastly, iii) dogs have more 
phenotypic variation than any other land mammal; and much of that variation is the result 
of “evolutionary” selection under domestication. The strengths of dog models of complex 
genetics have been exploited mainly in the area of cancer (9), but recently also in 
behavior. Examples include investigation of obsessive compulsive disorders in select 
breeds (10, 11) and diverse behavioral traits in one breed (nerve stability, affability, 
wariness, adaptability, sharpness, activity and reactions during blood draws (12)). 

Strikingly, multiple groups recently showed that most genetic variation associated 
with diverse morphological traits in dogs can be mapped by cross-breed GW A using only 
breed stereotypes and about a dozen or more individuals each from dozens of diverse 
breeds (13-15). This revealed that a great extent of the genetic variation in domesticated 
dogs was present prior to breed creation. A good example is the trait of body size, the 
majority of which is explained by six gene variants in all but very large breeds (see the 
following and refs within: (16)). Those canine genes are relevant to known size-biology 
across phyla (e.g., IGF1, IGF1 receptor, growth hormone receptor, HMGA2 and SMAD2) 
and others indicate opportunities for discovery (e.g., STC2). A landmark GW AS from 
2008 reported many loci and five candidate genes associated with the following 
behavioral traits: herding, boldness, excitability, pointing, and trainability (13). A more 
recent study that used the same stereotype data, and a cohort with different breeds (15), 
did not replicate those findings for one overlapping trait, boldness (which was associated 
with loci on five chromosomes in the first study and a single other chromosome in the 
second). The chrlO region that was associated with boldness in the newer study, between 
the MSRB3 and HMGA2 genes, was the same as was strongly associated with two 
morphological traits - reduced ear erectness and small size. Although each of the three 
traits appeared to be associated with a different haplotype, with one exception, all bold 
breeds were erect-eared and small, and vice versa for non-bold breeds. This region spans 
among the most highly-differentiated markers reported from single-marker Fst analysis 
and, at 2Mb, it is the second largest of such regions. Similarly, Vaysse et al. showed that 
sociability (attitude toward unknown humans) maps to the highest Fst region in the 
genome (2.6Mb, chrX), which was shown by others to be associated with skull shape and 
large size (14). To our knowledge, there are no further claims to resolve the various 
genetic associations or to suggest biological relevance of those loci to boldness and 
sociability; they appear to be open questions. 

Here we report mapping fear and aggression traits associated with genetic 
variation shared across diverse breeds. These represent very common and important 



canine traits in the behavioral veterinary setting (77), and in human public health (18). It 
seems likely to us that our findings will also prove to be relevant to human anxiety 
disorders and aggression, violence and criminality. Additionally, dog is the only animal 
that was originally domesticated by humans for almost-purely behavioral traits - and 
arguably is the only predator to be fully domesticated. Fear, aggression and related traits 
like tameness have long been thought to be central to the domestication of dogs (79), and 
this is supported by experimental domestication of silver foxes (20). Both wild wolves 
and foxes are typically more fearful and aggressive than their domesticated counterparts; 
however, some dog breeds have been actively selected for enhanced aggressiveness in 
certain contexts such as fighting, guarding or vermin control. Our findings show that 
canine fear and aggression that are directed toward strange humans or other dogs share 
variation that was present prior to the creation of dog breeds. In another work, we report 
fine mapping analysis that implicates specific single genes at two loci associated with 
canine fear and aggression (27). Both genes are highly tissue-specific - with highest 
expression levels in the amygdala and pituitary/hypothalamus, respectively - and suggest 
strong biological relevance for these traits. One variant is protective and the other 
increases risk of fear and aggression. We discuss below how variation at these loci may 
have been selected-for during the process of domestication. 

Results 
Study design 

The present study was designed to test whether breed stereotypes of fear and aggression 
could be mapped by cross-breed GW A. While this concept has been validated for 
morphological traits, it has not been for behavioral traits. Success here requires two 
primary elements: biologically-relevant and robust phenotype data (seemingly likely 
from studies cited below), and the sharing of behaviorally-associated genetic variation 
across diverse breeds (which is unknown). We used three unrelated breed-specific 
resources: one of behavioral phenotypes (22) and two of breed-specific genotypes (14, 
15). The phenotype dataset is derived from C-BARQ owner questionnaires (23). In C- 
BARQ, fear and aggression comprise five and four subtypes, respectively. All but two of 
these C-BARQ phenotypes (‘dog rivalry’ and ‘touch sensitivity’) were previously 
validated using a panel of 200 dogs with prior diagnoses of specific behavior problems 
(23). More recently, other studies have also provided criterion validation by 
demonstrating associations between these phenotypes and particular training outcomes in 
working dogs (24), and the performance of dogs in various standardized behavioral tests 
(25-27). There is currently no alternative phenotype resource that approximates the 
numbers of breeds and traits represented. The two genotype datasets used here are those 
used to map cross-breed stereotypes and study population genetics by Vaysse and 
Ratnakumar et al. (a large European collaboration including the LUPA Consortium, led 
by Webster; 30 breeds, 175,000 SNP markers) and Boyko et al. (a large USA 
collaboration led by Bustamante and Ostrander; 45,000 SNPs). The behavioral and 
genotype datasets overlap for 11 breeds and 29 breeds for Vaysse’s and Boyko’s datasets 
respectively. 



Our study design included the following: i) principal components analysis (PCA) 
of breed phenotypes and genotypes (C-BARQ data and each genotype dataset); ii) 
discovery study of GWA mapping of published C-BARQ behavioral data (i.e., that 
corresponding to the Vaysse subset of the 30 most popular American Kennel Club 
breeds) with the Vaysse genotypes; iii) confirmation study of the Discovery results using 
C-BARQ and Boyko datasets; and iv) testing of the internal-consistency of the C-BARQ 
phenotypes and of the prediction-model performance in a second set of breeds for which 
phenotype data was not previously published. In separate work, we report fine mapping 
the two peak regions associated with canine fear and aggression directed to other dogs 
and human strangers (21). 

Discovery studies of genetic association with fear and aggression 

Assuming sufficient power, the greatest potential limitations of the present GWAS’s are 
latent variables, such as cryptic relatedness and batch effects, and population structure. 
We mitigated these in several ways, beginning by using many breeds instead of few. We 
used two genotype datasets that represented different genotyping platforms and cohorts 
with partially overlapping breed contents as discovery and confirmation datasets. Thus, 
each dataset has different batch effects, population structure and cryptic relatedness. 
Lastly, we controlled for relatedness and population structure in the GWAS’s by using a 
centered relatedness matrix correction (i.e., the Genome-wide Efficient Mixed Model 
Association algorithm or GEMMA (28)). 

Before initiating mapping studies, we conducted Principal Components Analysis 
(PCA) of the breed-specific C-BARQ data on fear and aggression together with the 
genotype data from each of the genotype datasets separately (Fig. 1). Component scores 
of the breeds evaluated (Figs. 1B/D) showed no relevant deviations across the two 
genetic datasets. Breeds evaluated in both datasets are distributed similarly in both plots. 
This preliminary evaluation suggests that, despite having different breed makeups in the 
two datasets, they show consistent results. Using either genotype dataset the following 
traits are clustered together apart from the others: stranger-oriented fear, stranger-directed 
aggression and dog-directed aggression. This suggested to us that the three traits could be 
genetically-related. A second implication of the PCA results is that owner-directed 
aggression is most distant, and therefore different, from the three clustered traits 
mentioned above. In other words, the PCA indicated a testable hypothesis that the three 
clustered traits would share associated loci, but that owner-directed aggression was 
associated with other loci (we go on to confirm this). 

Our initial discovery study tested for genomewide-significant association for fear 
and aggression traits from C-BARQ using the genotype dataset of Vaysse et al. (15) 

(Figs. 2, 3). This GWAS involved 175,000 SNP markers genotyped in 150 individuals 
from 11 breeds. Consistent with the PCA results, stranger-oriented fear, and stranger and 
dog-directed aggression is predominantly associated with many markers at two loci - 
chrl8:23,260,370 and chrX: 105,245,495-105,877,339 (CanFam2 assembly) - whereas 
owner-directed aggression is distinct. Rather, the latter is associated with previously 
described small-size variation in the IGF1 gene on chrl5:44Mb and with a single marker 
at chr34:29Mb (Fig. 2C). Dog-oriented fear, which does not purely cluster with stranger- 



directed fear/aggression and dog-directed aggression (but is not distant), is also 
predominantly-associated with the chrl8 and X loci. Touch-sensitivity also has both of 
those loci, but a marker on chrlO:l 1Mb (affecting HMGA2; in this dataset, it is the 
second strongest cross-breed variant associated with small size after IGF1 (75)) has the 
second strongest signal and chrX: 105Mb is considerably weaker. These latter two 
findings are consistent with the PCA pattern in Figure 1C, where touch-sensitivity and 
dog-oriented fear are closest to stranger-oriented fear and stranger- and dog-directed 
aggression. The other fear traits share one, but not both, of the chrl8 and X loci: 
nonsocial fear is associated with chrl8 and the same chrlO:llMb marker associated with 
touch-sensitivity; and separation-related anxiety is associated with chrX: 105Mb and 
chrlO:llMb (chr 18:23Mb is suggestive). 

Confirmation of genetic association results in a second cohort 

We repeated the discovery studies using the same behavioral data, but with the Boyko 
genotype dataset (14) (Figs. 2, 3; Table 1). The data we used was genotypes of 45,000 
markers in 327 individuals from 29 breeds, 11 of which overlapped the Vaysse data used 
above. Because the same breeds were not used in the two studies, this is not strictly a 
replication study. However, because we are looking at cross-breed association, the 
findings in one can confirm those in the other. We also expect the paired GWAS’s to 
mitigate false positive hits that are due to latent variables or population structure in the 
individual studies. 

The results of the confirmation GWAS’s generally confirmed associations of 
chr 18 and X with stranger and dog-oriented fear and aggression. Both loci are 
genomewide-significant in dog-oriented fear, but only chr 18 is significant in stranger- 
oriented fear and aggression (chrX was suggestive in both). Dog-directed aggression has 
no significant hits, but has suggestive evidence for chr 18 among the top ten markers. As 
in the Vaysse GWA above, owner-directed aggression is most strongly associated with 
several markers that peak within the IGF1 gene on chrl5:44Mb. Whereas the discovery 
GW AS of dog rivalry has no hits, the confirmation GW AS shows strong association with 
IGF1; and, unlike in the discovery GW AS, this locus is also significant in dog-oriented 
fear, separation-related anxiety, and touch-sensitivity. We interpret this as predominantly 
due to the breed make-up of the two cohorts. At the level of genomewide significance, 
the most similar results between the two studies are for dog-oriented fear (chr 18 and X), 
separation-related anxiety (chr 10 and chrX) and touch sensitivity (chr 10 and chr 18, with 
chrX being significant in discovery and almost significant in confirmation). 

Prediction model and internal consistency of C-BARQ behavioral data 

We next tested the C-BARQ instrument for internal consistency of the behavioral data. 
Specifically, we wanted to know if the behavioral data is robust as well as qualitatively 
and quantitatively consistent across breeds. Alternative possibilities include low signal to 
noise ratios, biases due to human breed-ownership behaviors, and biases related to dog 
status or behavior that affect participation in C-BARQ study. This analysis does not 
require one to assume that a trait classified as fear or aggression is necessarily so (or that 



those can be sub classified in biologically meaningful ways); it is only to test whether the 
same classification in one group of breeds is the same as that in another group. 

Our most robust discovery and confirmation candidates presented above are those 
four loci significantly-associated with C-BARQ fear and aggression traits in two 
independent datasets (from different genotyping platforms applied to different breed 
cohorts): chrlO, chrl5, chrl8 and X. We used those markers to design a phenotype 
prediction model that could be applied to the breeds for which C-BARQ behavioral data 
exists, but has not been published previously. The model was created using our results for 
the published C-BARQ data for the subset of the top 30 American Kennel Club breeds 
(22) for which we had genotype data. It was implemented on all other breeds for which 
we have genotypes within Vaysse’s dataset; this was done without prior knowledge of the 
C-BARQ breed stereotypes that were being predicted. We thus predicted fear and 
aggression for 18 breeds with existing, but previously unpublished, C-BARQ data. We 
used a model design of multiple linear regressions with a stepwise forward selection 
method (Fig. SI). The discovery breeds’ allele frequencies at each of the four loci were 
used as primary data. We allowed the selection method to keep only significant predictors 
by trait. Then we used the beta coefficients estimated for the predictors remaining in the 
model, along with the allele frequencies of the prediction phase breeds, to estimate their 
breed-predicted value for that trait. 

Across all prediction models (Fig. SI), the hit on chrlO was never significant and 
was thus never included. Chrl5 was removed by the method for all traits except for 
owner-directed aggression and dog rivalry, for which it was the only significant predictor. 
This evidence suggests small dog size is highly correlated with these types of aggression 
as previously suggested (29), but it is not a strong predictor of other types of fear and 
aggression. Chrl8 hits remained significant in all other traits, with the exception of 
owner-directed aggression and dog rivalry. In all models, both chrl8 hits 
(chrl 8:23260370 and chr18:23298242) were included at the beginning of the selection 
process but only one or the other remained. These two markers are partially correlated 
and we assume the detection of one or the other is due to the ancestral haplotype 
diverging across breeds. The chrl 8 hits were the only significant predictors for dog- 
oriented fear, dog-directed aggression, nonsocial fear and touch sensitivity. The chrX hits 
were significant predictors for stranger-oriented fear, stranger-directed aggression and 
separation-related anxiety. As with chrl 8, all three chrX hits detected across all analyses 
(chrX: 105245495, chrX: 105770058 and chrX: 105877339) were included at the 
beginning of the selection process, but only one remained for each trait. These markers 
are very highly correlated and therefore their inclusion in the model can be 
interchangeable. R-square values for all predictions range from 0.54 to 0.9. We observed 
that beta coefficients for the chrl 8 hits are always positive and have values double in size 
to the beta coefficients for the chrX hit, which is always negative (Fig. SI). Our 
prediction models suggest that chrl 8 and X have an additive effect on fear and aggression 
variability. More precisely, our data suggests that the most common chr 18 haplotype 
across dog breeds is protective from some types of fear/aggression (Fig. 4). In contrast, 
one of the two common haplotypes on chrX (Fig. 4) increases risk. 



To evaluate the performance of our prediction models, we calculated predicted 
values for all breeds not used in the discovery and confirmation phases and compared 
them to the values obtained from the raw unreleased C-BARQ data (by calculating the 
difference between them). To determine whether the performance of our prediction is 
superior to random chance, we first estimated the chance of randomly making a 
successful prediction within a 95% confidence interval for all of the observed values. 
Since the sample sizes for each breed is different, the random chance of making a 
successful prediction within a 95% confidence is also different. The median chance for 
making a successful prediction by chance across all breeds and traits is 5.82%, ranging 
from 1.1% to 30.28%. Next, we calculated the cumulative probability of all random 
chances. We estimated that 11 out of the 162 values predicted would be expected by 
chance. Lastly, we evaluated if the difference between the observed and predicted values 
is inside or outside of the confidence interval, and declared a success for every case 
where our predicted value was inside. Our total number of successful predictions was 95 
out of 164 or 58% which is significantly better than random chance (p<0.0001). We 
propose that this validates the internal consistency of C-BARQ behavioral data across 
breeds. A success and failure matrix that summarizes our predictions is displayed in 
Figure 5. Our predictions are not skewed in a single direction across traits and breeds. 
However, our prediction efficiency is better for some traits and breeds than others. That is 
consistent with the possibilities that other variants are involved, and that those are not 
uniformly associated with those traits across many or most breeds. 

Haplotype analysis and candidate genes 

The associated loci and alleles on chrl5 ( IGF1 ), chrlO (. HMGA2 ) and chr3 ( IGF1R ) are 
the same as those known to be associated with small size across dog breeds. The 
corroborated hits on chrl8 at 23,260,370 and 23,298,242 map between GNAT3 and CD36 
(Fig. 6A). The region was reported to be associated with allometric dog height and leg 
length (14), and correspondingly demonstrated to harbor a FGF4 -retrogene insertion that 
causes chondrodysplasia in several dog breeds (30). In the Vaysse dataset, the fear- 
increasing chrl8 haplotype is only present in Dachshunds and Yorkshire Terriers. Both 
those breeds carry the chondrodysplasia retrogene insertion. However, while the short¬ 
legged Dachshund is fixed for the insertion and always exhibits the trait, the insertion is a 
common minor allele in the Yorkshire Terrier but it never results in the trait (speculated 
to be due to its small size (30)). The following breeds that have >20% allele frequencies 
of the increased-fear haplotype, but lack common chondrodysplasia: Shar-Pei, Beagle, 
Cocker Spaniel and Husky (both the Beagle and Cocker Spaniel are known to lack the 
retrogene-insertion allele (30)). These observations suggest that the retrogene insertion 
occurred in the ancestral increased-fear allele, and that the two traits are only linked in 
that resulting lineage. The corroborated hit on chrX: 105,877,339 is within the 2.6Mb LD 
block with the strongest Fst signal in the dog genome (15). Others have mapped large 
size and skull shape to this locus, but no clear gene implications have been reported (14). 
The peak fear region has one non-coding RNA and two protein coding genes, with IGSF1 
being the closest (Fig. 6B). In most breeds that carry the reduced-fear allele (9/11 in the 
Vaysse dataset), that allele is on the same haplotype as the increased-size allele. While 



the linkage is not nearly as strong between the alleles for increased fear and sociability, 
they are also correlated in several breeds. These findings at the four loci are consistent 
with the PCA analysis that shows at least two different fear and aggression phenomena - 
one of which is associated with dog size. 

Analysis of the peak region on chrl8 reveals a very low level of LD (Fig. 6A), 
indicating that the associated haplotype is very old. In contrast to chrl8, the associated 
region on chrX lies within a 2.6-3.7Mb block of strong LD (Fig. 6B; (14, 15)). We find 
the same increased-fear alleles at the two loci are present at frequencies of 43 and 50%, 
respectively, in extant wolf populations (31, 32). Analysis of relevant breeds using whole 
genome sequence data (31, 32) and DNA copy number variation (CNV) data (33-35) 
appears to rule out that the functional variants are protein coding changes or CNVs. 

Based only on that and the positional results, we would propose the functional variation 
affected expression of GNAT3 or CD36 on chrl8 and 1GSF1 (or the flanking genes 
ARHGAP36 and FIRRE) on chrX. However, the locus on chrX requires detailed breed- 
specific haplotype analysis to determine if fine mapping is possible within that 2.6Mb LD 
block. In the Discussion we mention our separate work that analyzes the evolutionary 
patterns of variation at the two loci and considers the biological relevance of the two 
genes implicated by that (GNAT3 and IGSF1). 

Discussion 

Understanding fear and aggression in dogs is important for canine wellbeing, human 
public health and to understand the process of dog domestication. It also has great 
potential to lead to medical translation to related mental disorders in humans. Here we 
mapped behavioral traits by using breed stereotypes to conduct a series of interbreed 
genome scans. This approach is well developed for morphological traits, but has not been 
validated for behaviors. In this work, we validate our findings by conducting separate 
GWAS’s using cohorts comprised of partially overlapping breeds (Table 1). We then 
provide additional validation by developing a predictive model and applying it 
successfully to a different group of breeds. This also serves to demonstrate internal 
consistency of C-BARQ phenotyping across breeds. 

Our PCA of breed genotypes and phenotypes indicates that some types of fear and 
aggression are related to each other (stranger-oriented fear and dog-/stranger-directed 
aggression), but are distinct from others such as owner-directed aggression. This pattern 
was mirrored by the results of our behavioral GWAS’s which identified two genome loci 
associated with the former (also shared with dog-oriented fear) and another two 
associated with the latter (also shared with dog rivalry). Notably, owner-directed 
aggression and dog rivalry are associated with the same variation in IGF1 that is known 
to have the greatest contribution to small-size across dog breeds (the former trait is also 
associated with the small-size variant at the IGF1 receptor gene). This finding is 
consistent with previous reports that i) there is a highly-significant correlation between 
the behaviors of owner-directed aggression and dog rivalry, ii) this correlation is 
independent of dog- and stranger-directed aggression, and iii) these behaviors are 
associated with breeds of small to medium size (29, 36, 37). And owner personality does 
not necessarily predispose to owner-directed aggression, which is thus an apparent dog 



trait (38). Some of those studies also showed a correlation between small size and 
stranger-oriented fear and aggression, dog-oriented fear, separation anxiety, and touch 
sensitivity (29). That is supported by our finding in the confirmation GWAS’s that the 
same small-dog IGF1 allele is associated with the latter three traits. It is unclear whether 
the behavioral associations with small-size gene variants are due to developmental, 
physiological or psychological effects; all seem probable. 

The loci on chrl8 and X are particularly interesting because they are novel and 
associated strongly with the related traits of fear and aggression directed to dogs and 
human strangers. Variants in those regions are also associated with morphological traits 
such as leg length and body size, respectively. The closest genes to the chrl8 and X 
association peaks are GNAT3 (Gustducin alpha, the G protein alpha subunit for bitter, 
sweet and umami taste cells (39)) and CD36, and IGSF1 (which results in a congenital 
syndrome affecting Thyroid and Growth Hormones when mutant in humans (40)), 
respectively. The fear/aggression peak near IGSF1 lies within a 2.6-3.7Mb region of 
strong LD that includes flanking peaks for size and sociability (14,15). In a separate 
report, we demonstrate how evolutionary footprints can be used to fine map functional 
variation across dog breeds (21). There, we show that the reduced-fear/aggression allele 
is often in perfect LD with the increased-size allele, but much less so with regards to the 
increased-sociability peak allele. We also provide evidence that the fear/aggression 
functional variants on chrl8 and X are most likely to affect expression of GNAT3 and 
IGSF1, respectively. [Notably, IGSF1 is co-expressed with its two flanking genes, 
ARHGAP36 and the long non-coding RNA F1RRE (21, 41).] Both of those neuronal 
genes have strong biological relevance at the level of neuroanatomy (see 21 and refs, 
within). Outside taste receptor and other chemosensory cells (and a subset of 
vomeronasal interneurons (42)), GNAT3 is most highly expressed in the amygdala (21). 
IGSF1 is predominantly and very abundantly expressed in two brain regions - the 
pituitary and parts of the hypothalamus (21, 43). Further studies are necessary to 
determine if the size and fear/aggression traits on chrX are due to the same, distinct or 
overlapping variation. It is interesting that at least three genes in the region are co¬ 
expressed in tissues involved in determination of body size (the pituitary gland and 
hypothalamus), and that mutations in IGSF1 affect human size (40). Rat studies have 
revealed that mRNA expression of both genes are regulated in fear-relevant models: 
GNAT3 in amygdala under pain stimuli (44); and 1GSF1 in cortex after stress or tactile 
stimuli (45). Thus both genes are strong candidates for fear relevance: they are neuronal 
and associated with stress/anxiety according to interbreed dog GWA, neuroanatomy and 
biology. 

Because our model of the chrl8 and X variation was successful in predicting the 
relevant fear and aggression behaviors in a third group of non-overlapping breeds, we 
believe these markers can be used to predict and, in part, explain such behavior across 
many dog breeds. However, the behavioral stereotypes of some breeds were not 
explained by our predictive model, and many breeds have not been tested. It seems likely 
that many breeds have epistatic variation with chrl8 and X variants or have other variants 
that are less commonly associated with these traits across dog breeds. All of these issues 
can be addressed by studies of individual breeds. Since we have only found the common 



co-occurrence of the increased-fear/aggression variation on chrl8 and X in small and 
medium dogs, it will be interesting to see if this is also present in large dogs bred for 
aggression or fighting. 

Dogs were the first animals to be domesticated by humans. New studies indicate 
that dog domestication has a single origin in southern East Asia ~33,OOOYA, followed by 
migration to the Middle East, Africa and Europe ~15,000YA (46). Canine fear and 
aggression are of great interest because those traits - mainly the loss of fear of humans - 
are widely believed to underlie the mechanism for domestication (19). We thus 
considered whether the variation we identified at chrl8 and X could have been involved 
in the process of dog domestication. Our analysis of published genetic variation from 
extant wolf populations across the world shows that both fear/aggression alleles 
(protective on chrl8 and risk on chrX) are common in wolves - approximately 50% (31, 
32) and 43% (32), respectively. In domesticated dogs, 27 of 30 breeds in Figure 4 have 
the reduced fear/aggression chrl8 allele at a frequency of at least 75% (19/27 are fixed at 
a level of 100%). In contrast, only 10/27 breeds have the reduced-fear/aggression chrX 
allele at frequencies greater than that of wolves (starting at 68% allele frequency; 5 are 
fixed at a level of at least 95%). The common high-frequencies of the chrl8 reduced- 
fear/aggression allele across dog breeds are consistent with selection of reduced fear and 
aggression in the domestication of dogs. Interpretation of the chrX region seems less 
clear because the majority of breeds have the increased-fear/aggression allele. However, 
in parallel studies we show that the reduced fear/aggression allele has been under 
selection (i.e., its haplotype size is always large whereas the alternative allele is generally 
far smaller (21)). The high frequency of the four alleles at chrl8/X in extant wolf 
populations may seem counterintuitive in a model where ancestral wolves were generally 
more fearful and aggressive than domesticated dogs, but this could be explained by 
balancing selection or recent positive selection. 

A concrete understanding of dog domestication is expected soon as there is a 
major effort underway to sequence ancient dog genomes (47). Our prediction is that 
positive selection of the reduced-fear chrl8 and chrX variants was part of the 
domestication process. We further propose that humans have obscured those roots in 
modem dog breeds by selecting for increased aggression or for other linked 
morphological or behavioral traits (e.g., for short legs at chrl8 or for increased aggression 
or smaller size at chrX). We expect there are many other fear/aggression-associated loci 
that are more difficult to map across breeds. But it seems likely that having two very 
common alleles influencing these behaviors led to frequent selection between those to set 
a level of reactivity and disposition towards dogs and humans. In this regard, it seems 
plausible that the correlation between small size and the highest levels of fear/aggression 
is because the same behavior in large dogs is generally unacceptable to humans (36). In 
our principal mapping study, only small dogs - mainly Dachshund and Yorkshire Terrier 
- have high frequency of fear/aggression associated alleles at both chrl8 and X. The fact 
that dog and stranger oriented fear and aggression are generally much more strongly 
associated with chrl8 and X variants than with IGFH chrl5, HMGA2/chr\0 and 
IGF1R/ chr3 small-size variants further establishes that small size is not the predominant 



cause. However, it is also notable that a subset of large breeds carries the chrX reduced- 
fear/aggression and increased-size variants in perfect LD. 

The biochemistry and neuroanatomy of the emotions of fear and aggression are 
highly conserved in vertebrates, and some argue this is true across the animal kingdom 
(48). Across vertebrates the most immediate response to extreme threat involves the 
transmission of different sensory signals through the following sequence of brain regions 
(referred to as the low road): thalamus, amydgala, hypothalamus and pituitary gland, 
which sends nerve and hormonal signals to the adrenal glands, which in turn direct acute 
(through noradrenaline/adrenaline) and sustained (through glucocorticoid hormones such 
as cortisol) stress responses. This low road corresponds to innately programmed 
responses and is associated with emotions such as fear and anger. A parallel cognitive 
pathway that is referred to as the high road diverges at the thalamus, by then going to 
primary sensory and association centers in the cortex before continuing to the amygdala. 
Both pathways also involve bidirectional signaling with the hippocampus. Thus, while 
the immediate response to fear may be predominantly innate and emotional, it is not 
completely separate from cognition. There is extensive molecular and behavioral 
evidence that the hypothalamic-pituitary-adrenal (HPA) axis is the most critical driver of 
behavioral stress. Biochemical pathways implicated in social fear and aggression include 
signaling by serotonin and dopamine, and neuropeptides such as the predominantly- 
hypothalamic oxytocin and vasopressin (26). Notably, the domestication of another 
canid, the fox, resulted in foxes with greatly reduced HP A activity (20). After 45 
generations of selection for tameness in foxes, basal blood cortisol levels were reduced 
three-fold and stress-induced levels five-fold (compared to normal foxes). Domesticated 
foxes also have increased levels of brain serotonin, consistent with its inhibitory effect on 
aggression. New analysis of selective sweep regions associated with domestication of 
pigs showed that GNAT3 lies in one of the sweep regions of European (but not Asian) 
pigs (49); but it remains to be seen if GNAT3 variation is directly associated with that 
domestication event. Our findings that loci spanning GNAT3 (which is highly expressed 
in the amygdala) and 1GSF1 (highly expressed in the hypothalamus and pituitary gland) 
are associated with canine fear and aggression are thus consistent with a very large body 
of work implicating the HPA axis. 

It is clear from animal and human studies that fear and aggression are often 
associated, but it is not always in the same direction (50). Based on human behavior and 
pharmacology, the links between anxiety and aggression are very complex. Similarly, 
early life stresses in people and animals are associated in complex ways with anxiety 
disorders and aggression in adulthood. Early life stress generally involves changes in the 
HPA axis, and results in increased anxiety and altered social and aggressive behaviors. 
Animal models with a profile similar to the dog case presented here - in which both fear 
and aggression are elevated - are rare. Examples of knockout mouse models that have 
this property include those for enkephalin and a-calcium-calmodulin kinase II (51, 52). 
Selective breeding has also yielded strains that have increases in both anxiety and 
aggression. One of those is the North Carolina mouse, in which acute diazepam treatment 
reduces both anxiety and aggression (55). The other example is a strain of Novosibirsk 
Norway rats that was bred for increased aggression to humans (54). It is not immediately 



clear which human conditions may be most relevant to the present dog model at chrl8 
and X. Most likely those will include anxiety disorders. Notably, some anxiety conditions 
are associated with increased aggression, and this includes a subset of those affected by 
social anxiety. 

Now that we have identified common variants associated with fear and aggression 
across dog breeds, the segregation of these variants within breeds provide powerful 
models in which to dissect their biology at the levels of neuroanatomy, physiology and 
behavior. Among the areas of research on fear and aggression (48) that are ongoing in 
dogs and will be vastly accelerated by genetic handles are i) development and 
environmental-malleability (27, 55); ii) molecular/biochemical and imaging descriptions 
at baseline and under acute stress (20), iii) effects on mental and physiological states in 
the life course (24-26, 36), and iv) feasibility to mitigate negative effects through 
cognitive or pharmacological treatments (56, 57). In parallel, it will be important to 
determine the molecular mechanisms of these fear/aggression variants, and to identify 
their interactions with other genes and environmental factors. 


Materials and Methods 
Experimental design overview 

Since the analysis of dog breed data is highly vulnerable to population structure issues 
and false positive detection, we designed our analysis in two phases: first, a discovery 
phase, where available SNP data was used to map aggression and fear behavioral traits. 
We designed this phase to analyze two independent SNP datasets using two independent 
behavioral phenotypes for aggression and fear. Significant hits were taken then into a 
second phase for validation. To validate our findings we evaluated the performance of 
behavioral values predicted for breeds not included in the discovery phase. The 
expectation was to be able to predict behavioral traits from a few markers. 

Discovery Behavioral phenotypes 

Phenotype values for aggression and fear variables were taken directly from the C-BARQ 
values previously published for the top 30 most popular breeds of the AKC (22). We 
refer to this collection of behavioral phenotypes as “C-BARQ phenotypes”. This C- 
BARQ dataset is a collection of owner reported behavioral data of AKC registered dogs. 
Only the breeds for which SNP data (see SNP datasets section) were available were 
included in the analysis; therefore a total of 6,818 subjects were used to determine the 
phenotypical values. C-BARQ data decomposes aggression into 4 classes: Stranger- 
directed aggression (towards unfamiliar humans), Dog-directed aggression (towards 
unfamiliar dogs), Owner-directed aggression and Dog rivalry (towards familiar humans 
and dogs, respectively). In a similar way C-BARQ data decomposes fear into 5 classes: 
Stranger-oriented fear (towards unfamiliar humans), Dog-oriented fear (towards 
unfamiliar dogs), Nonsocial fear (towards environmental phenomena), Separation-related 
anxiety (being left alone by the owner) and Touch sensitivity. 


Validation Behavioral phenotypes 



To validate the hits mapped using the discovery data, we inferred predictions based on 
the markers detected for the breeds from the Vaysse dataset (75) that were not included in 
the top 30 most popular breeds (22). Our predicted C-BARQ values for 18 dog breeds 
(see phenotype prediction analysis section) were compared to C-BARQ phenotypes 
calculated from unreleased raw data (Serpell, unpublished). These C-BARQ phenotypical 
values were obtained from 2,130 subjects of 18 breeds. Only one breed (Greenland 
Sledge dog) included in the Vaysse dataset had no data available in the C-BARQ 
database and thus was excluded from the prediction analysis. 

SNP datasets 

Two previously published SNP datasets were used in this study. The first dataset 
contained -175,000 SNPs on the Illumina CanineHD array; we refer to this dataset as the 
“Vaysse dataset” (75). The second dataset contained -45,000 SNPs on the Affymetrix v.2 
Canine array; we refer to this dataset as the “Boyko dataset” (14). The Vaysse dataset 
contained a total of 456 subjects representing 30 dog breeds while the Boyko dataset 
contained 890 subjects representing 80 dog breeds. Since the stereotypic phenotype data 
was not available for all the breeds included in each of the datasets, only those for which 
phenotypes were available were kept; therefore, the Vaysse dataset contained 150 
subjects from 11 dog breeds while the Boyko dataset contained 327 subjects from 29 dog 
breeds. All dog breeds included in the Vaysse dataset used were also included in the 
Boyko dataset. Since the Vaysse dataset has higher resolution and cleaner signal as 
compared to the Boyko dataset (see original publications for more details) we designated 
the Vaysse dataset as our main discovery dataset while the Boyko dataset would be used 
for further validation of the findings detected on the Vaysse dataset. Both datasets are 
independent of each other and no subjects are shared between them. 

Genomewide Association Analysis and Mapping 

The preparation of datasets and subject removal were carried out in PLINK vl.07 (58). 
Principal component analysis evaluation was performed on GCTA v.1.24 (59). All 
association analysis were performed on GEMMA v.0.94.1 (28). Population structure was 
removed by using the centered relatedness matrix correction; the association tests were 
performed using the Univariate Linear Mixed Model using the Likelihood Ratio Test. 
Genomewide significance was declared for the Vaysse dataset for a P-value equal or less 
than 1 x 10~ 8 ; for the Boyko dataset, genomewide significance was declared for a P-value 
equal or less than 1 x 10' 5 . GEMMA was run on the Ohio Supercomputer Center’s 
Oakley Cluster (www.osc.edu) for faster processing. To avoid irreproducibility issues, no 
dataset trimming or LD clustering were performed on the SNP data. All Manhattan plots 
were generated by SAS v.9.3 from GEMMA outputs. Genomewide significant hits were 
mapped on the UCSC Genome Browser (60) but coordinates were lifted to take 
advantage of the enhanced annotation available from the Broad Institute CanFam3 
Improved Annotation Data V.l (61) since the original SNP coordinates provided were 
CanFam2. 


Haplotype analysis of genomewide significant hits 



LD blocks of adjacent top significant hits and haplotype determination were evaluated by 
Haploview v.4.2 (62). Only top hits were included in the haplotype determination. For 
hits within the X chromosome all subjects were deemed as females since the Vaysse 
dataset is not annotated by sex. 

Phenotype Prediction Analysis 

Significant hits detected across the Vaysse and Boyko datasets for each trait in the 
discovery phase were further evaluated for their predictive performance. For this part, 
allele frequencies for the top significant hit for all dog breeds used in the discovery phase 
and dog breeds not included in the discovery phase (prediction phase breeds) were 
calculated using PLINK vl.07. Each of the significant markers’ allele frequencies were 
linearly regressed using a stepwise forward selection method based on an 
inclusion/exclusion alpha cutoff of < 0.05 excluding the intercept. All statistical modeling 
was performed on SAS v9.3. For each iteration, the selection process tested if any of the 
allele frequency markers would significantly contribute to the model in terms of P-value 
significance; if the marker became non-significant after the addition of another marker to 
the model then it was removed from the model. The final iteration contained all markers 
that were below the alpha inclusion/exclusion criteria. Once the equation parameters were 
estimated, predictions were made using the beta coefficients for each marker that 
remained in the model with the allele frequencies obtained from the prediction phase 
breeds. By this method, a prediction value for each of the traits for every breed evaluated 
was generated. 

To evaluate the performance of the aggression and fear predictions made, we 
calculated mean C-BARQ values observed from the unreleased raw database and 
calculated 95% confidence intervals for each. Sample size and standard errors used to 
estimate each of the confidence intervals corresponded to the sample size/standard errors 
of each breed from the unreleased raw dataset for the respective trait. Since the C-BARQ 
values are contained within a fixed scale that goes from 0 to 4, a uniform distribution was 
assumed to calculate the cumulative probability associated to each confidence interval 
range. With this, it was possible to estimate the chance of making a correct prediction by 
a random guess. We calculated the difference between the observed and our predicted 
value and, if the difference was contained within the confidence interval of the observed 
value we considered the prediction a success, while in the case where the predicted value 
was outside of the confidence interval, it was considered a failure. To determine if the 
performance of our prediction outcome was superior to a random guess outcome, we 
tested using a one-tailed Fishers Exact test if the success /failure counts of our prediction 
outcome were superior to the random guess outcome. Since our models were generated 
independent of each other, we did not explore the prediction efficiency scenario where a 
trait is evaluated given that another trait or traits estimates are known. In summary, we 
evaluated if the chance of correctly predicting the set of values by random guessing was 
comparable to the predictions made using our models based on the genomewide 
significant markers for each trait. 
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Fig. 1: PCA analysis of behavioral traits. Since the two GWA datasets only partially 
overlapped in breed content, PCA analysis was performed to evaluate if the breed 
makeup affects the distribution structure of the variables. (A, B) Component pattern and 
component scores, respectively, of C-BARQ behavioral traits on the matching breeds in 
the Boyko dataset. (C, D) Component pattern and component scores of C-BARQ 
behavioral traits on the matching breeds in the Vaysse dataset. 

Fig. 2: Manhattan plots of C-BARQ aggression traits. Vertical lines indicate relevant 
and consistent hits across the two GWA datasets. In each panel, the top plot corresponds 
to the Vaysse dataset while the bottom plot corresponds to Boyko dataset. (A) Stranger- 
directed aggression. (B) Dog-directed aggression. (C) Owner-directed aggression. (D) 
Dog rivalry. 

Fig. 3: Manhattan plots of C-BARQ fear traits. Vertical lines indicate relevant and 
consistent hits across the two GWA datasets. In each panel, the top plot corresponds to 
Vaysse dataset while the bottom plot corresponds to Boyko dataset. (A) Stranger-oriented 
fear. (B) Dog-oriented fear. (C) Nonsocial-oriented fear. (D) Separation-related anxiety. 
(E) Touch-sensitivity. 

Fig. 4: Haplotype distribution across dog breeds based on novel chrl8 and X 
markers associated with aggression and fear. Haplotypes were defined only on the 
alleles of the top markers detected in this study. Allele distributions are color coded on a 



gradient that goes from yellow (fixed for allele 1) to blue (fixed for allele 2). Allele label 
letters are arbitrary. 


Fig. 5: Success/failure matrix of predicted values. Green fill indicates a successful 
prediction and red is a failed prediction. Columns correspond to aggression and fear traits 
while rows correspond to dog breeds predicted. Totals are the sum of successful 
prediction within the column/row; columns and rows are sorted in numerical order and 
cells have a fill color gradient that goes from red (worse) to green (best). 

Fig. 6: Linkage Disequilibrium plots for chrl8/X fear and aggression loci. (A) Chrl8 
GWA locus for fear/aggression (using C-BARQ phenotypes and the Vaysse genotype 
dataset). (B) ChrX locus for the same GWAS as A. Genomic coordinates are converted 
from CanFam2 to CanFam3, and gene information is from the Broad Institute’s CanFam3 
Improved Annotation Data v. 1. Linkage disequilibrium plot was created with Haploview 
v.4.2. 
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Table 1. Summary of canine fear and aggression GWAS results 

Fear and aggression trait peaks are given for separate GWAS studies using Vaysse (marked "V"; Illumina 
HD) and Boyko ("B"; Affymetrix v.2) genotype datasets. Loci shared with both are black and others are 
gray. Coordinates use CanFam2 assembly. 

14 The peak SNPs chrX: 105,245,495, chrX: 105,770,058, chrX: 105,877,339, and chrX: 106,189,665 
(numbered 1-4 in superscript, respectively) lie within one LD block. At least SNPs 2 and 3 are presumed to 
implicate the same haplotype/functional variant; candidate genes refer to these peaks. 

5 The peak SNP is 23,298,242 (vs. chrl8:23,260,370 for the others). 

6 The peak SNP for Vaysse is chrlO: 11,169,956 and for Boyko is chrlO: 10,859,628. 

7 Vaysse peak SNP chrl5:44,258,017; Boyko peak SNP chrl5:44,226,659 
8 Peak SNP is a coding variant at a generally mammalian-conserved position. 

9 ARHGAP36, IGSF1 and long non-coding RNA FIRRE are co-expressed, including in the pituitary gland 
and hypothalamus (see text). 
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C-BARQ Trait 

C-BARQ Trait Description 1 
(number of questionnaire items) 


Stranger-directed 

Severity of threatening or aggressive 
responses to strangers approaching or 
invading the dog’s or owner’s personal 
space, territory, or home range (10) 


Dog-directed 

Severity of threatening or aggressive 
responses when approached directly by 
unfamiliar dogs (4) 

Aggression 

Owner-directed 

Severity of threatening or aggressive 
responses to the owner or other members of 
the household when challenged, manhandled, 
stared at, stepped over, or when approached 
while in possession of food or objects (8) 


Dog Rivalry 

Severity of aggressive or threatening 
responses to other familiar dogs in the 
household (4) 


Stranger-oriented 

Severity of fearful or wary responses when 
approached directly by strange or unfamiliar 
people (4) 


Dog-oriented 

Severity of fearful or wary responses when 
approached directly by unfamiliar dogs (4) 


Nonsocial 

Severity of fearful or wary responses to 
sudden or loud noises, traffic, and unfamiliar 
objects and situations (6) 

Fear 

Separation-related 
behavior (anxiety) 

Frequency of vocalizing and/or destructive 
behavior when separated from the owner, 
including autonomic signs of anxiety— 
restlessness, loss of appetite, trembling, and 
excessive salivation (8) 


Touch Sensitivity 

Severity of fearful or wary responses to 
potentially painful or uncomfortable 
procedures, including bathing, grooming, 
nail-clipping, and veterinary examinations 

(4) 


Table SI. C-BARQ fear and aggression trait descriptions (numbers of 
questionnaire items in parentheses) 




Breed 

Trait 

N 

Mean 

StDev 

Var 

BeT 

DogAggr 

110 

0.908 

0.814 

0.663 

BeT 

DogFear 

110 

0.616 

0.821 

0.674 

BeT 

NonFear 

112 

0.616 

0.646 

0.417 

BeT 

OwnAggr 

112 

0.118 

0.281 

0.079 

BeT 

RivalryAg 

107 

0.635 

0.632 

0.399 

BeT 

SepFear 

112 

0.364 

0.493 

0.243 

BeT 

StrAggr 

112 

0.640 

0.542 

0.293 

BeT 

StrFear 

111 

0.632 

0.889 

0.791 

BeT 

TouchFear 

111 

0.501 

0.480 

0.231 

BMD 

DogAggr 

195 

0.506 

0.676 

0.456 

BMD 

DogFear 

194 

0.500 

0.689 

0.474 

BMD 

NonFear 

195 

0.622 

0.678 

0.460 

BMD 

OwnAggr 

195 

0.061 

0.195 

0.038 

BMD 

RivalryAg 

185 

0.250 

0.544 

0.296 

BMD 

SepFear 

194 

0.261 

0.403 

0.162 

BMD 

StrAggr 

194 

0.349 

0.397 

0.158 

BMD 

StrFear 

196 

0.619 

0.905 

0.818 

BMD 

TouchFear 

193 

0.392 

0.469 

0.220 

BoC 

DogAggr 

640 

1.068 

0.990 

0.980 

BoC 

DogFear 

649 

0.846 

0.863 

0.746 

BoC 

NonFear 

650 

0.965 

0.792 

0.627 

BoC 

OwnAggr 

659 

0.135 

0.362 

0.131 

BoC 

RivalryAg 

588 

0.667 

0.810 

0.656 

BoC 

SepFear 

661 

0.472 

0.615 

0.378 

BoC 

StrAggr 

649 

0.596 

0.678 

0.460 

BoC 

StrFear 

647 

0.731 

0.940 

0.884 

BoC 

TouchFear 

643 

0.750 

0.790 

0.624 

BoT 

DogAggr 

70 

1.289 

1.125 

1.265 

BoT 

DogFear 

72 

0.981 

0.932 

0.868 

BoT 

NonFear 

70 

0.716 

0.677 

0.459 

BoT 

OwnAggr 

72 

0.097 

0.209 

0.044 

BoT 

RivalryAg 

60 

0.281 

0.633 

0.401 

BoT 

SepFear 

2 

0.520 

0.532 

0.283 

BoT 

StrAggr 

71 

0.405 

0.462 

0.214 

BoT 

StrFear 

72 

0.524 

0.845 

0.713 

BoT 

TouchFear 

71 

0.720 

0.686 

0.470 

BrS 

DogAggr 

111 

0.673 

0.778 

0.606 

BrS 

DogFear 

109 

0.664 

0.764 

0.583 

BrS 

NonFear 

112 

0.572 

0.498 

0.248 

BrS 

OwnAggr 

116 

0.094 

0.249 

0.062 

BrS 

RivalryAg 

97 

0.353 

0.509 

0.259 

BrS 

SepFear 

115 

0.661 

0.617 

0.381 

BrS 

StrAggr 

115 

0.438 

0.481 

0.232 


BrS 

StrFear 

BrS 

TouchFear 

Elk 

DogAggr 

Elk 

DogFear 

Elk 

NonFear 

Elk 

OwnAggr 

Elk 

Rivalry Ag 

Elk 

SepFear 

Elk 

StrAggr 

Elk 

StrFear 

Elk 

TouchFear 

ESt 

DogAggr 

ESt 

DogFear 

ESt 

NonFear 

ESt 

OwnAggr 

ESt 

RivalryAg 

ESt 

SepFear 

ESt 

StrAggr 

ESt 

StrFear 

ESt 

TouchFear 

Eur 

DogAggr 

Eur 

DogFear 

Eur 

NonFear 

Eur 

OwnAggr 

Eur 

RivalryAg 

Eur 

SepFear 

Eur 

StrAggr 

Eur 

StrFear 

Eur 

TouchFear 

FSp 

DogAggr 

FSp 

DogFear 

FSp 

NonFear 

FSp 

OwnAggr 

FSp 

RivalryAg 

FSp 

SepFear 

FSp 

StrAggr 

FSp 

StrFear 

FSp 

TouchFear 

GoS 

DogAggr 

GoS 

DogFear 

GoS 

NonFear 

GoS 

OwnAggr 

GoS 

RivalryAg 

GoS 

SepFear 


116 

0.460 

0.672 

0.451 

116 

0.646 

0.673 

0.453 

33 

0.725 

0.887 

0.787 

33 

0.606 

0.862 

0.742 

33 

0.631 

0.644 

0.414 

33 

0.225 

0.518 

0.268 

27 

0.241 

0.329 

0.108 

33 

0.331 

0.494 

0.244 

33 

0.605 

0.738 

0.544 

33 

0.318 

0.567 

0.321 

32 

0.690 

0.695 

0.483 

76 

0.602 

0.827 

0.684 

74 

0.720 

0.877 

0.770 

76 

0.490 

0.562 

0.316 

78 

0.115 

0.317 

0.101 

67 

0.578 

0.722 

0.521 

78 

0.423 

0.577 

0.333 

78 

0.316 

0.345 

0.119 

78 

0.327 

0.538 

0.289 

75 

0.421 

0.588 

0.346 

45 

0.461 

0.538 

0.290 

48 

0.293 

0.323 

0.105 

48 

0.453 

0.420 

0.176 

47 

0.118 

0.358 

0.128 

44 

0.470 

0.718 

0.515 

48 

0.271 

0.435 

0.189 

48 

0.382 

0.474 

0.225 

45 

0.517 

0.728 

0.530 

47 

0.576 

0.568 

0.322 

10 

1.067 

0.700 

0.489 

10 

0.708 

0.429 

0.184 

10 

0.920 

0.658 

0.433 

10 

0.101 

0.277 

0.077 

7 

1.214 

0.847 

0.717 

10 

0.313 

0.531 

0.282 

10 

0.350 

0.409 

0.167 

10 

0.500 

0.874 

0.764 

10 

0.700 

0.550 

0.303 

27 

0.784 

0.784 

0.614 

27 

0.657 

0.896 

0.804 

27 

0.618 

0.556 

0.309 

27 

0.061 

0.107 

0.011 

26 

0.635 

0.686 

0.471 

27 

0.482 

0.717 

0.514 



GoS 

StrAggr 

GoS 

StrFear 

GoS 

TouchFear 

Gry 

DogAggr 

Gry 

DogFear 

Gry 

NonFear 

Gry 

OwnAggr 

Gry 

RivalryAg 

Gry 

SepFear 

Gry 

StrAggr 

Gry 

StrFear 

Gry 

TouchFear 

IrW 

DogAggr 

IrW 

DogFear 

IrW 

NonFear 

IrW 

OwnAggr 

IrW 

RivalryAg 

IrW 

SepFear 

IrW 

StrAggr 

IrW 

StrFear 

IrW 

TouchFear 

JRT 

DogAggr 

JRT 

DogFear 

JRT 

NonFear 

JRT 

OwnAggr 

JRT 

RivalryAg 

JRT 

SepFear 

JRT 

StrAggr 

JRT 

StrFear 

JRT 

TouchFear 

NFd 

DogAggr 

NFd 

DogFear 

NFd 

NonFear 

NFd 

OwnAggr 

NFd 

RivalryAg 

NFd 

SepFear 

NFd 

StrAggr 

NFd 

StrFear 

NFd 

TouchFear 

NSD 

DogAggr 

NSD 

DogFear 

NSD 

NonFear 

NSD 

OwnAggr 

NSD 

RivalryAg 


26 

0.450 

0.502 

0.252 

27 

0.444 

0.864 

0.747 

27 

0.521 

0.528 

0.278 

138 

0.536 

0.798 

0.636 

139 

0.758 

0.864 

0.747 

141 

1.007 

0.849 

0.721 

143 

0.132 

0.261 

0.068 

120 

0.526 

0.628 

0.394 

138 

0.629 

0.701 

0.491 

138 

0.138 

0.322 

0.104 

139 

0.698 

1.077 

1.159 

139 

0.690 

0.759 

0.577 

45 

0.541 

0.974 

0.948 

44 

0.472 

0.791 

0.626 

45 

0.660 

0.679 

0.461 

45 

0.045 

0.094 

0.009 

38 

0.423 

0.758 

0.574 

45 

0.391 

0.444 

0.197 

45 

0.210 

0.235 

0.055 

45 

0.411 

0.745 

0.554 

44 

0.511 

0.604 

0.365 

284 

1.477 

1.142 

1.304 

279 

0.826 

0.906 

0.821 

285 

0.930 

0.764 

0.584 

291 

0.322 

0.547 

0.299 

242 

0.983 

0.958 

0.918 

291 

0.707 

0.668 

0.446 

286 

0.799 

0.770 

0.592 

289 

0.575 

0.837 

0.701 

279 

0.833 

0.715 

0.511 

45 

0.674 

0.782 

0.611 

48 

0.462 

0.653 

0.426 

49 

0.712 

0.643 

0.413 

49 

0.131 

0.355 

0.126 

42 

0.448 

0.603 

0.364 

49 

0.455 

0.431 

0.186 

49 

0.350 

0.393 

0.154 

49 

0.199 

0.635 

0.404 

49 

0.617 

0.611 

0.373 

66 

0.607 

0.716 

0.512 

66 

0.525 

0.658 

0.434 

66 

0.701 

0.672 

0.452 

67 

0.091 

0.208 

0.043 

62 

0.496 

0.612 

0.374 



NSD 

SepFear 

67 

0.465 

0.496 

0.246 

NSD 

StrAggr 

65 

0.299 

0.354 

0.125 

NSD 

StrFear 

66 

0.451 

0.603 

0.364 

NSD 

TouchFear 

67 

0.657 

0.681 

0.464 

Sci 

DogAggr 

27 

1.016 

0.921 

0.848 

Sci 

DogFear 

27 

0.633 

0.681 

0.464 

Sci 

NonFear 

27 

0.595 

0.549 

0.301 

Sci 

OwnAggr 

27 

0.186 

0.347 

0.120 

Sci 

RivalryAg 

26 

0.667 

0.759 

0.577 

Sci 

SepFear 

26 

0.351 

0.385 

0.148 

Sci 

StrAggr 

26 

0.764 

0.717 

0.513 

Sci 

StrFear 

26 

0.538 

0.673 

0.453 

Sci 

TouchFear 

26 

0.788 

0.848 

0.719 

ShP 

DogAggr 

37 

1.392 

1.008 

1.016 

ShP 

DogFear 

36 

0.701 

0.795 

0.632 

ShP 

NonFear 

38 

0.735 

0.760 

0.577 

ShP 

OwnAggr 

40 

0.238 

0.504 

0.254 

ShP 

RivalryAg 

26 

0.772 

0.753 

0.567 

ShP 

SepFear 

40 

0.699 

0.818 

0.670 

ShP 

StrAggr 

40 

1.029 

0.866 

0.750 

ShP 

StrFear 

38 

0.961 

0.907 

0.823 

ShP 

TouchFear 

39 

1.077 

1.081 

1.169 

Wei 

DogAggr 

107 

0.805 

0.754 

0.568 

Wei 

DogFear 

109 

0.878 

0.792 

0.628 

Wei 

NonFear 

107 

0.612 

0.669 

0.448 

Wei 

OwnAggr 

112 

0.123 

0.401 

0.160 

Wei 

RivalryAg 

97 

0.490 

0.654 

0.427 

Wei 

SepFear 

111 

0.799 

0.845 

0.714 

Wei 

StrAggr 

109 

0.641 

0.683 

0.467 

Wei 

StrFear 

111 

0.585 

0.776 

0.602 

Wei 

TouchFear 

109 

0.616 

0.687 

0.473 


Table S2. C-BARQ behavioral phenotypes used to 
test predictive model 

1 Breed abbreviations from Vaysse et al. (i) 
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A 


Y = ij+ pi(chrA:MAF) + /32(chrB:MAF)+... 


B Risk of negative behavior 

-► 

■ Lower score _ ( _ Higher score ■ 

I jj * I 

pi(chr18:MAF) 

◄- 

p2(chrX:MAF) 

c 

Stranger Aggression Score = 0.6857 + 0.73785(chr18:MAF) -0.24217(chrX:MAF) 
Dog Aggression Score = 0.91511 + 0.74044(chr18:MAF) 

Owner Aggression Score = 0.062179+ 0.25121(chr15:MAF) 

Rivalry Score = 0.42109 + 0.353678(chr15:MAF) 

Stranger Fear Score = 0.63331 + 0.80863(chr18:MAF) -0.29395(chrX:MAF) 

Dog Fear Score = 0.66267 + 0.58023(chr18:MAF) 

Nonsocial Fear Score = 0.67135 + 0.52984(chr18:MAF) 

Separation Related Anxiety Score = 0.6068 + 0.39148(chr18:MAF) -0.23978(chrX:MAF) 
Touch Sensitivity Score = 0.5625 + 0.62219(chr18:MAF) 


Fig. SI. Summary of prediction equation. (A) Generalized equation of the model used 
for the prediction of behavioral traits. MAF corresponds to the Major Allele Frequency of 
the SNP marker evaluated. (B) Interpretation example of the equation parameters for the 
prediction of negative behavior, p sets the overall mean, pi(chrl8:MAF) increases the 
risk while p2(chrX:MAF) decreases the risk. Therefore, a dog breed with high risk for 
negative behavior has a high chrl8:MAF and a low chrX:MAF; on the other side, a dog 
breed with low risk for negative behavior has a low chrl8:MAF and a high chrX:MAF. 
This specific example applies directly to Stranger-directed aggression, Stranger-oriented 
fear and Separation-related anxiety. (C) Regression equations for each of the nine C- 
BARQ behavioral traits. 
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One Sentence Summary: Analysis of genome scans, positive selection and biological 
relevance implicate GNAT3 and IGSF1 in canine fear and aggression. 


Abstract: 

While extensive linkage disequilibrium (LD) in dogs facilitates broad mapping, it is a 
formidable obstacle for fine mapping. We combine data from interbreed genome scans 
and intrabreed signatures of positive selection to fine-map dog traits. After demonstrating 
the concept with two known canine-trait loci, we analyzed two new genome loci 
associated with canine fear and aggression. We thus implicated GNAT3 (chrl8) and 
IGSF1 (chrX) to be the top genes associated with fear and aggression in dogs. Our ability 
to fine map the X chromosome variant is striking because it lies within a ~3Mb LD block 
that has the highest signal of selection in dogs and also contains association peaks for 
increased size (which is in perfect LD with the decreased-fear/aggression allele in 9/11 
tested breeds) and sociability (which we suggest is a distinct trait associated with 
HS6ST2 ). The patterns of brain expression of GNAT3 and IGSF1 indicate they are 
excellent candidates for involvement in the fear pathway centered on the amygdala/ 
hypothalamic-pituitary-adrenal axis. 


Main Text: 

Introduction 

The dog is an extremely powerful genetic model in unique ways (i, 2). Despite earlier 
divergence time with humans than mouse, dogs are more similar to humans at the levels 
of DNA and protein sequence - and share 600Mb of DNA sequence that was lost in mice 
(i, 3). Dogs age 5-7 times faster than humans, are numerous (~75M in the US), receive 
health care and have several hundreds of characterized inherited diseases. But most 
importantly, dogs represent a one-of-a kind genetics resource. There are approximately 



400 dog breeds and each is on the order of 100-fold simpler than the full population. 
Within breeds, variation is far less than within the major human ethnic groups; but across 
breeds, variation is much greater than across human populations. Dogs are thus powerful 
for broad mapping in genome scans, but have two potential limitations: i) fine mapping is 
difficult due to expansive linkage disequilibrium, or LD (on the order of 50-fold greater 
than in humans ( 4 )); and ii) some traits cannot be mapped by classical genetic approaches 
within single breeds because each breed is fully or nearly homozygous over 
approximately 25% of the genome (5). Successful studies have mitigated those problems 
by first broad mapping across breeds and, after discovery of additional variation in the 
region of interest, fine-mapping within multiple related breeds that are known to share a 
founder variant. (6) This approach has in part given way to the use of increased marker 
densities - currently -175,000 single nucleotide polymorphism (SNP) markers - to 
enable finer mapping in a single study. But the high levels of LD in dogs continue to be a 
major limitation to fine mapping. In such cases, experimental screening of candidates 
may not be feasible and the only recourse may be prioritization based on variant analysis 
and biological relevance (7). 

New studies indicate that there was a single origin of dog domestication from 
wolves approximately 33,000YA in southern East Asia; and this was followed by 
migration to the Middle East, Africa and Europe, beginning about 15,000YA ( 8 ). The 
evolutionary history of dogs has led several investigators to search for evidence of 
selection across the genome (as has been done in humans (9)) (5, 10-13). For instance, 
Akey et al. identified a very large region of population differentiation in the Shar-Pei and 
showed that it is associated with its unique trait of wrinkled skin. Similar patterns of 
selection have also been reported in cases where a breed stereotype is associated with a 
fully penetrant haplotype that is often fixed for the risk or non-risk allele within any 
breed (5, 12). The landmark work by Boyko et al. and Vaysse et al. represent the most 
comprehensive studies of stereotype mapping and population genetics across breeds. The 
former included 915 dogs from 80 dog breeds and the latter 509 dogs from 46 breeds; 
there was partial breed overlap in the two (these genotype resources are the basis for the 
present study). Vaysse et al.’s article is titled “Identification of genomic regions 
associated with phenotypic variation between dog breeds using selection mapping” (5). In 
addition to genomewide association (GWA) mapping of boldness, sociability and several 
morphological breed-stereotypes, they conducted extensive discovery of evolutionary 
footprints of selection. That included population differentiation using the fixation index 
(F S t; when several breeds share founder variation) and D, (designed to detect selection in 
one or few breeds out of a larger group; pairwise Fst values are normalized for a breed 
vs. the genomewide average, then summed across pairwise combinations involving that 
breed (10)), extended haplotype homozygosity (EHH, or cross-population EHH; the 
probability of identity by descent is calculated using all genotypes in a dataset (9)), and 
reduced heterozygosity (5,; the sum of regional deviations in levels of genomewide 
relative-heterozygosity between two breeds is compared to the genomewide average, and 
the sum of those across all pairwise comparisons is calculated (5)). Of Vaysse et al.’s top 
18 Fst regions, 10 overlapped a total of 15 GWA loci that were previously published 
(n=l 1) or new (n=4). The top two F S t regions are approximately 2.5 and 2Mb long, and 



each contains at least three known trait associations that are distinct according to 
haplotype analysis. Boyko et al. had similar result, with seven of the top 10 Fst regions 
being the same in both studies. In addition, Vaysse et al. identified extensive evidence of 
selective sweeps that may be specific to one or a few breeds, which could be missed by 
F S t analysis. Several regions of high differentiation (D,) and reduced heterozygosity (S l ) 
correspond to known stereotype-associated haplotypes (e.g., skin wrinkling, 
chondrodysplasia and furnishings), but the vast majority of those regions have not been 
linked to any trait. 

Genetic hitchhiking generally refers to neutral variation that is carried along with 
an allele under positive selection (14). As the selected variant goes to fixation, there is a 
loss of variation flanking that site (termed a selective sweep) (15, 16). LD in such a 
selected region increases dramatically, and that effect is the basis of several approaches 
for identifying positive selection (e.g., EHH, which detects large haplotypes suggestive of 
selective sweeps). In contrast to the early LD effects, as the functional allele approaches a 
frequency of 50%, a distinct pattern emerges: LD on one side of the selected allele will 
remain high and decrease slowly, but LD across the selected site decreases quickly (17, 
18). Because dog breeds represent hundreds of isolated populations, they are an ideal 
species in which to use hitchhiking patterns of LD to fine-map variation that underwent 
selection. Some breeds should have strong LD only upstream of the selected site, and 
others only downstream. Although hitchhiking theory is rarely mentioned there, its 
signature of extended LD can be seen in several reports where one or few breeds exhibit a 
very large block of homozygosity. Most of those loci were first mapped by GWA to some 
trait (e.g., chondrodysplasia and furnishings), with the exception of wrinkled skin in the 
Shar-Pei (10, 19, 20). Here we directly test for evidence for and against selection at 
canine peak GWA-SNPs using the conventional approaches of phasing or identity by 
descent (IBD) haplotype analysis. These have been used in human genetics in the 
contexts of population genetics and IBD mapping (21). Until the advent of high 
throughput sequencing, limited human haplotype-phasing was implemented through triad 
analysis or direct experimental methods. Rather, most of that work was and continues to 
be done through probabilistic methods. 

We show that haplotype phasing can be done within breeds by separately 
analyzing all dogs homozygous for each allele at any position (and determining the extent 
of flanking homozygosity). We first demonstrate the use of D„ S) and haplotype phasing 
analysis applied to evaluate previously published GWA loci, and then use it to fine map 
variation identified by GWA of canine fear and aggression traits in two cohorts with 
partially overlapping breeds (22). Specifically, single loci on chrl8 and chrX are 
associated with stranger-directed fear, dog-directed fear, stranger-directed aggression, 
dog-directed aggression, touch-sensitivity and non-social fear; and the chrX locus is also 
associatated with separation anxiety. Of those traits, dog-directed fear, touch sensitivity, 
non-social fear and separation anxiety are also associated with known variants at IGF1, 
IGF1R or HMGA2 that are associated with small size. However, in contrast to the chrl8 
and X variants, those associations had a negligible effect in predictive models that were 
validated in a third group of dogs from different breeds. Rather, small size variation is 
strongly predictive for owner-directed aggression and dog rivalry (which are not strongly 



associated with the chrl8 and X loci). By combining that chrl8 and X GWA data with 
multiple types of evidence of selective sweeps and the gene expression patterns of a total 
of five genes at those loci, we implicate two genes in the core neuroanatomical circuit 
that drives fear and aggression - the Hypothalamic-Pituitary-Adrenal (HPA) Axis. 

Results 

Signatures of positive selection 

The present studies were devised to fine map two loci from our GWA study of canine 
fear and aggression traits (22). Our mapping was done using the published breed 
behavioral stereotypes from the C-BARQ resource and the genotype datasets of Boyko et 
al. and Vaysse et al. (5, 12, 23). The latter studies have partial overlap in breed makeup; 
Boyko’s has more breeds and dogs (genotyped on the Affymetrix v.2 Canine array), and 
Vaysse’s has higher SNP density (Illumina CanineHD array). Both of our primary fear 
and aggression loci lie among the highest Fst regions in dogs. The top Fst region in both 
studies, on chrX, includes one of our GWA peaks. That interval was said to be 3.7Mb 
(0.795 Fst) and 2.6Mb (0.75 Fst) long, respectively (5, 12). Boyko et al. reported this 
region is associated with body size and many aspects of skull shape (both absolute and 
proportional; distinct from brachycephaly), but these were only reported as r 2 correlation 
values for the general chrX: 104Mb region (without mention of peak SNPs or p-values). 

In the present work, we confirmed this chrX association with size, but using the Vaysse 
genotype dataset (see Methods; (5)), and show the location of the peak SNP. Vaysse et al. 
also mapped sociability within this chrX Fst interval. The region is thus consistent with 
the increased LD that is associated with a selective sweep, but interpretation and fine 
mapping are complicated by the reduced recombination rate on chrX. The other fear and 
aggression locus lies on the chrl8 Fst region that ranks fifth in Boyko’s analysis, but is 
absent using the same thresholds in Vaysse {Fst >0.55; minor allele frequency >15%). 
This single-SNP F S t site tags, by linkage, the FGF4 retrogene insertion that causes 
chondrodysplasia in several breeds (but some breeds have that SNP allele/ancestral 
haplotype without the insertion). Because it is absent in Vaysse’s F S t regions (which has 
higher SNP density), this indicates that our fear/aggression GWA hit in both datasets is 
unlikely to be a false positive due to shared population structure in the two cohorts. We 
found no report of attempts to use genetics or biology to identify the functional variation 
on the chrX locus. This seems surprising given that it is the top F S t locus across different 
samples of diverse dog breeds and that it is the first known locus associated with large 
size and an important behavioral trait (sociability). By first appearances the region seems 
impenetrable by fine-mapping due to strong and extensive LD. The sociability trait 
(which was only significant in Vaysse’s analysis of female dogs) maps to a peak of 10 
SNPs spanning 580kb. 

We first conducted analysis of known GWA haplotypes for signatures of ancient 
and recent selection. The small-size functional variant in IGF1, which has the strongest 
contribution to dog size across all but very large dog breeds, was indicated to predate dog 
domestication (10, 11). Figure 1A shows contiguous SNPs in the Vaysse dataset centered 
on the GWA peak SNP for small size that lies within IGF1 on chrl5 (5). Thirteen of the 
28 breeds have D, regions that intersect the peak SNP, and those represent both small and 



large breeds. The strong differentiation signal is consistent with Boyko and Vaysse 
detecting this locus as the fourth and fifth top Fst region, respectively. 5, regions from 
two breeds intersect the tag marker, and those from five others comprise a small block 
upstream. We interpret this observation to mean that the reduced variation associated 
with the selective sweep is negligible because the selection went to fixation so long ago. 
The question remains why seven non-small breeds share a block of 5, shortly upstream of 
the small allele (e.g., this could reflect selection of another functional variant at this 
locus). We also phased haplotypes in individual breeds by selecting all homozygotes of 
each allele at the peak SNP and then separately measuring the length of homozygosity 
across all individuals. Put another way, we directly determined phasing rather than using 
probabilistic methods. The pattern of phased haplotypes suggests the T allele haplotype 
that underwent selection for small size long ago is still generally longer than the 
alternative allele’s haplotype (18 and 15 kb, respectively). The Rottweiler was previously 
shown to be anomalous because it has a high frequency the small dog allele, but is in fact 
very large (24); to us this is consistent with a second variant strongly compensating for 
the small dog variant. All three tests indicate signal in the Rottweiler, including the 
longest Si region and phased haplotype of all the breeds. Curiously, all eight breeds with 
large phased haplotypes overlap the upstream seven-breed block of Si. All of those 5, 
breeds for which we have phasing (5 of the 7) have large phased haplotypes, and that 
includes the four largest - which range from about 200-400kb. 

Next we looked at the furnishings variant within RSP02 on chrl3, which was 
selected in different breeds relatively recently (Fig. IB) (20). Five breeds carry the peak 
GWA SNP associated with furnishings, all of which are known to have the phenotype 
(furnishings on Jack Russell Terrier is evident in the long coated version recognized by 
the US American Kennel Club - called Parson Russell Terrier). Of those five breeds, four 
have Di signal spanning the peak SNP (a fifth is shifted ~50kb upstream) and two of 
those same four have 5, signal (two others are shifted ~20 and 70kb). [Only one non¬ 
furnishings breed has overlapping D, signal and it is the smallest Di interval of all, and 
none has 5, signal.] Phasing analysis shows the haplotype containing the selected allele is 
much larger than the alternative allele’s (range of 186-1,369 vs. 7.5-43kb; minimal 
overlapping intervals of 186 vs. 7.5kb). To contrast these findings for anciently and 
recently selected alleles, we evaluated polymorphic alleles that are not well implicated by 
signatures of selection (Supplementary Fig. 1). These show significant differences in 
patterns and range of sizes of phased haplotypes. Thus, future studies are necessary to 
consider the feasibility and utility of genome scanning for measures of positive selection 
including phased haplotype analysis. 

In Figure 2 we present analysis of the new fear and aggression locus on chrl8 
(22). The haplotype phasing analysis shows the two breeds with the increased risk allele 
- Dachshund and Yorkshire Terrier - have much larger haplotypes compared to the 
alternative allele (ranging up to 683 vs. 186; minimal overlap regions of 418 vs. 13.2 kb). 
Their haplotype sizes, and the central position of the putative variant under selection, 
suggest a recent selective sweep according to hitchhiking theory. Notably, only a subset 
of the increased-fear/aggression haplotype under selection contains the FGF4 retrogene 
insertion that causes chondrodysplasia in some breeds but not in others. All known 



exceptions to the co-occurrence of the retrogene insertion and the phenotype are very 
small dogs - Yorkshire Terrier, Chihuahua and Japanese Chin - leading the authors to 
propose the trait is not manifest in carriers that are small (19). [That study included four 
Yorkshire terriers that had an insertion-allele frequency of 50%; in our phasing analysis 
of this breed, we found 4 dogs homozygous for the increased-fear allele (Fig. 2) and 3 
homozygous for the alternative allele (which reveals a short phased haplotype spanning 8 
SNPs).] Moreover, the existence of insertion-positive and -negative versions of the 
ancestral chrl8 increased-fear/aggression haplotype are also evident from Cocker 
Spaniels and Beagles - both of which have >20% allele frequencies of the increased- 
fear/aggression haplotype (5), but lack the insertion and chondrodysplasia (19). Shar- 
Pei’s and Huskies also have >20% frequencies of the increased-fear/aggression haplotype 
(5, 12), but have negligible or no chondrodysplasia. Thus, there is evidence of recent 
selection for the increased-fear/aggression haplotype in Dachshunds and Yorkshire 
Terriers (large haplotype size); that selection is presumed to be for short legs in the 
former but unknown for the latter. Only the two breeds that carry the chrl8 increased- 
fear/aggression allele at high frequency have Dj signal spanning the peak SNP, and both 
also have the longest S) signal either intersecting or close to the peak SNP. The allele 
protective of fear and aggression has high allele frequencies in all but the two breeds that 
carry the risk allele, and is fixed (>95%) in 22 of the 30 breeds (5 others are >75%). In 
contrast, two sources of wolf sequence data from around the world show the wolf allele 
frequency of the two alleles to be -50%. We propose these patterns are consistent with 
selection during the domestication process. If so, that selected reduced-fear and 
aggression variant is narrowed to a two-SNP minimal overlap interval of phased 
haplotypes. 

In Figure 3 we present a 5Mb chrX region that contains GWA peaks for size, fear 
and aggression, and sociability. As described above, this region has the highest Fst level 
in the dog genome and has strong LD across 2.6-3.7Mb (5, 12). The overall pattern of 
phased haplotypes and D ; and 5, signal suggests the three traits may be distinct. The 
fear/aggression peak overlaps D, and S) signal for many breeds, but that for size and 
sociability do not. However, there is perfect LD between the size and fear/aggression 
peaks in this haplotype in 9 of 11 breeds, and more data is necessary to establish their 
relationship. Human mutations in the top chrX candidate gene for fear/aggression, 

1GSF1, are known to affect human Growth Hormone biology (25), suggesting the two 
dog traits could share all or some genetic variation at this locus; additionally, expression 
of the gene abutting the peak SNP for size, ARGHAP36, is strongly correlated with 
IGSF1 expression (see below). Although all but one of the breeds with the 
fear/aggression-protective allele have perfect LD with the sociability allele, only half of 
the breeds with the sociability allele have the fear/aggression-protective allele in perfect 
LD. All breeds with the reduced-fear/aggression allele show D, signal overlapping the 
fear/aggression peak, but there is only one segment of each overlapping the sociability 
peak. On the other hand, there are eight 5, segments at the fear and aggression peak (one 
is shifted a single SNP away), but none is in a breed with the fear/aggression-protective 
allele. Thus, the predominant signatures of positive selection point to the association with 
fear and aggression. This is also suggested by the phased haplotype analysis, which 



shows minimal overlapping regions of 841 vs. 284 kb for the fear/aggression-protective 
and alternative alleles, respectively. In contrast, the appearance of such an effect at the 
sociability locus is due to the contribution of the 10 breeds that carry the fear/aggression- 
protective allele in perfect LD; and the minimal overlapping regions there are 85 vs. 47 
kb for the social allele and the non-social allele, respectively. Notably, this locus presents 
the possibility of a special case of hitchhiking in which selection initially occurs on one 
functional variant (here reduced-fear/aggression), but a second variant in the region is 
also favored in subsequent selection (here, size or sociability). As a result, relative to the 
original variant selected, the LD breaks down only on the side opposite the second 
selected variant. 

Based on modem worldwide wolf sequences, the minor chrX allele is that 
associated with increased-fear/aggression (43% frequency) (26). Of the diverse 30 breeds 
studied here, 20 have frequencies >70% for the increased-fear/aggression allele and 10 
have such frequencies for the alternative allele (with many breeds fixed for one of those). 
In Discussion we present the considerations for interpreting this. 

Fine-mapping functional variation 

Based on our test cases of known trait-associated variants selected in ancient or recent 
times, we called the minimal intervals for fear and aggression on chrl8 and X. Vaysse et 
al. showed that top ranked S) and D, signals mirror each other in a subset of regions, but 
the effects are highly variable; and interpretation seems complex and lacking fine 
resolution. The furnishings locus on chrl3 is an example of complementing S, and D { 
information (Fig. IB). However, whereas all five breeds with furnishings have D, 
intervals that intersect the peak SNP, three of those lack such evidence for 5,. D, may thus 
be more sensitive and precise for fine mapping in general, but many regions appear to be 
more broadly flagged by both statistics. Based on the findings above, we propose the 
following scheme: i) the maximum map interval corresponds to the region of overlap of 
phased haplotypes across all breeds (separately applied to both SNP alleles at a GW A 
peak); and ii) the minimum map interval can combine that with the smallest region of 
overlap of D, or 5, regions across breeds. Using these criteria, the ancient small-size 
variant in IGF1 is correctly predicted to lie in a maximum map interval of 18 kb, which is 
the same as the minimum map interval since D { or 5, regions do not further delimit the 
region. For the recently selected furnishings-variant in RSP02, it is correctly predicted to 
lie in a maximum map interval of 186 kb and a minimum map interval of 59 kb. In 
accordance with hitchhiking theory, it seems likely that the maximum map interval for a 
region under selection will always capture the functional variant. However, it is not clear 
how precisely that critical region could be narrowed using different signatures of 
selection; each case could be unique and is likely to require additional confirmation. 

For the chrl8 fear and aggression locus, the predicted maximum map interval for 
the most recently selected increased-fear/aggression haplotype is 418 kb and the 
minimum is 108 kb. However, if there was initially selection for the reduced- 
fear/aggression allele at domestication and subsequent selection for the increased- 
fear/aggression allele at the same position (as we speculate based on parsimony), then the 
maximum and minimum map intervals would be 13.2 kb. For the chrX locus, the 



predicted maximum map interval for the reduced-fear/aggression allele is 841 kb, and the 
minimum map interval is 218 kb. The sociability locus lacks significant DJSi evidence 
and has no phenotype for a key breed. The selected sociability allele T thus has maximum 
and minimum map intervals of 147 kb. Notably the original report of this sociability 
GWA, using the same genotype data, was only suggestive in the full dataset; rather, 
genomewide significance was detected in analysis of females exclusively, and yielded a 
peak of 10 SNPs spanning 580 kb. 

Biological relevance of candidate genes 

According to the minimal intervals, we have implicated the following genes: i) in the 
chrl8 fear and aggression locus, GNAT3 and CD36\ ii) in the chrX fear and aggression 
locus, IGSF1, F1RRE (long non coding RNA) and STK26; and iii) in the chrX sociability 
locus, MBNL3 and HS6ST2. Both sociability candidates are expressed in the brain: 
MBNL3 at very low levels and apparently not at all enriched in brain, but HS6ST2 at very 
high levels and highly enriched in many brain regions (BioGPS microarray data for 176 
human and 191 mouse tissues and cells (27); NCBI GEO GSE1133 and GSE10246). We 
favor HS6ST2 for the sociability association. For the fear and aggression loci, we only see 
clear biological relevance (mainly expression in sensory organs, brain or adrenal gland) 
for GNAT3 and IGSF1. 

GNAT3 encodes Gustducin alpha, the G alpha subunit that transduces taste 
receptor signaling. Gustducin alpha also has chemosensory roles in the vomeronasal 
organ, airways and gastrointestinal tract (28, 29). There are also reports that GNAT3 is 
expressed in areas of the brain that include the brainstem, hypothalamus and 
hippocampus (30-32). However, those studies targeted specific brain regions and we are 
not aware of systematic analysis of the entire brain in any mammal. We consulted the 
Allen Brain Atlas of in situ mRNA hybridization analysis (33) and found evidence that 
GNAT3 is most highly expressed in the amygdala in the adult mouse, specifically in layer 
2 of the Cortical Amigdalar Area (“CoA”; Fig. S2). That finding is supported by analysis 
of public gene expression data showing that the highest ranked expression-change of 
Gnat3 mRNA in any brain region is a 3.73-fold increase in the amygdala of rats 6h after 
pain exposure (Nextbio analysis server (34): Gnat3 has a percentile gene-ranking 
score=99 for the microarray experiment, p=0.0165; NCBI GEO Accession GSE1779 
(35)). The next-highest ranked (and significant after multiple testing correction) gene 
expression-changes reported in experimental paradigms involving the brain follow: i) rat 
hippocampus (GSE3531, score=82); ii) mouse cortex (GSE31840, score=81); and iii) 
mouse striatum (GSE48955, score=65). In addition to very high expression in the 
amygdala, the Allen Brain Atlas of the adult mouse reveals lower levels of expression in 
parts of the pons: Lateral reticular nucleus, Paragigantocellular reticular nucleus, lateral 
part, and the Facial motor nucleus. 

IGSF1 is Immunoglobulin superfamily member 1. Common human variation in 
IGSF1 is associated with age at menarche, and loss-of function mutations cause a human 
syndrome of congenital hypothyroidism, macroorchidism, Prolactin and Growth 
Hormone deficiency, delayed pubertal testosterone and obesity (25, 36). It is expressed 
very highly in the anterior pituitary and hypothalamus, but also in the choroid plexus. 



adrenal gland, pancreas, heart and skeletal muscle, fetal liver and testis (57). To generate 
quantitative data, we used the BioGPS Gene Atlas of genome wide gene expression in 191 
mouse tissues and cells. The increased-expression levels, relative to the median for all 
tissues are as follows: pituitary, 1,060-fold; hypothalamus, 340; amygdala, 26; heart, 22; 
spinal cord, 11; hippocampus, 10; placenta, 10; and nucleus accumbens, 5.5 (data is for 
probe 1433652_at). Immunohistochemical evidence clearly shows that IGSF1 protein 
expression is restricted to neurons (including at low levels in the cortex, lateral ventricle, 
and cerebellum (38)). Notably, there is strong evidence that IGSF1 is co-expressed with 
its two flanking genes, ARHGAP36 and the long non-coding RNA FIRRE. In the mouse 
BioGPS GeneAtlas of 191-tissues/cells, expression of 1GSF1 (probe 1433652_at) has 
Pearson correlations of 0.85 with FIRRE (1436638_at; two other FIRRE probes also have 
correlations >0.76) and 0.80 with ARHGAP36 (1454660_at); the same probe has a 
correlation 0.92 with a second IGSF1 probe. The human BioGPS GeneAtlas of 176 
tissues/cells lacks a probe for FIRRE, but IGSF1 and ARHGAP36 have a correlation of 
0.97 (probes 207695_s_at and gnflh04904_at). It is thus possible that variation in the 
IGSFI region could also affect the other two genes. 

Discussion 

Here we demonstrated the utility of combining widely used measures of positive 
selection to fine map GWA loci. This included breed differentiation, reduced variation 
and phased haplotype analysis. The latter was done similarly to how this is done in 
human triads: using a homozygous parent or offspring at any locus to determine the 
phased haplotype for it and, thus, for the heterozygous carrier. Here we analyzed each 
breed separately, and individually determined the extent of homozygosity spanning the 
risk and non-risk GWA alleles in homozygotes of each. One recurring pattern we see is 
that the selected haplotype tends to be larger, as expected. Another observed pattern that 
was expected, within a population (breed), is that the original LD that exists across the 
selected allele in a sweep breaks down over time; but LD will break down slowly 
between the selected allele and variants on one side of the selected site. Assuming the 
variation flanking a selected allele is neutral, then LD blocks (or phased haplotypes) from 
individual breeds will overlap at the middle but extend to one side or the other. 

The phased-haplotype and Z) ; data for small size at the IGF1 locus illustrate 
signatures of ancient selection. We interpret the chrl8 fear/aggression locus to have 
similar phased-haplotype evidence of very old selection for the reduced-fear/aggression 
A allele. In addition there is recent selection of the increased-fear/aggression C allele in 
two breeds. Consistent with hitchhiking theory, the latter is indicated by the large phased- 
haplotypes in two breeds, and the fact that the putative variant under selection is in a 
central position within those haplotypes. In the Daschound, which is fixed for that allele 
in perfect LD with the chondrodysplasia FGF4 retrogene insertion, that selection has 
been presumed to be for short legs. Yorkshire Terriers also have that insertion haplotype 
at high frequency but do not manifest proportionally-short legs - ruling out recent 
selection on that trait. [Other breeds, such as Beagles and Cocker Spaniels have high 
frequencies of the increased-fear allele, but lack the retrogene insertion and shortened 
legs.] While the Yorkshire Terrier was created in the 1800’s for vermin control in mills 



(and was also used for rat baiting), it is currently a very popular “toy” breed. That raises 
the possibility of recent human selection for dogs that prefer to be held closely (i.e., here, 
because they are fearful). Alternatively, both of these breeds could be under selection for 
aggression traits at the chrl8 locus. In contrast to chrl8, the chrX fear and aggression 
locus may be a special case in multiple respects - including that there may be a second 
linked allele under selection (for sociability or size), and that it is in a region with a low 
recombination rate (at least in part because it is on chrX). We also suspect that the 
increased- and decreased-fear/aggression variants at this same locus may both have been 
under selection at different times and in different breeds. 

Most breeds have very high frequencies or are fixed for the chrl8 reduced- 
fear/aggression allele. The phasing analysis of that haplotype is similar to that of the 
selected IGF1 haplotype in small breeds: they are short and thus presumed to be ancient. 
These observations converge to implicate involvement of the chrl8 decreased- 
fear/aggression allele in domestication. This is consistent with the widely held theory that 
dogs were domesticated by virtue of losing fear of humans (39, 40). In contrast to chrl8, 
20 of the diverse breeds studied here have the increased-fear/aggression chrX allele at 
>70% frequency, and only 10 have the reduced-fear allele at that level. Wolves have 57% 
allele frequency for the reduced-fear allele. The ancestral haplotype under selection is 
clearly that for decreased fear/aggression, but its age is difficult to discern because of its 
location on the X chromosome. Thus, despite the pattern of allele frequencies in the 
extant dog breeds and wolves sampled here, the putative advantage of reducing fear and 
aggression in the domestication of a predator leads us to speculate that selection of that 
allele was involved in domestication. The uneven distribution of the allele is consistent 
with geographical effects, gene flow between dogs and wolves, and natural or human 
selection. The latter possibility is hinted at by the few breeds that have long phased- 
haplotypes for the increased-fear/aggression allele, mainly the Weimaraner and 
Greyhound (both were originally hunting breeds). Also, it would not be surprising if wolf 
populations selected for the reduced-fear allele in the last several hundred years as 
interactions with humans became increasingly common. One possible scenario is that the 
ancestral wolf had a high frequency of the increased-fear/aggression allele; the 
alternative, reduced-fear, allele was selected in dog domestication (at least in some 
geographical areas); selection post-domestication favored the reduced fear allele in 
wolves; and natural/human selection or genetic drift resulted in the present patterns of the 
two alleles. The imminent availability of archeological/ancestral dog and wolf sequences 
should resolve this question (41). 

Combining GWA mapping data with different signatures of selection, we fine- 
mapped two new loci for fear and aggression (22) and one previously reported for 
sociability using the same dataset (5). We reduced each locus to two coding genes (one 
locus also has a IncRNA). For sociability, we favor Heparan sulfate 6-O-sulfotransferase 
2 (HS6ST2) because it is both very highly expressed and enriched in brain (the other 
candidate, Muscleblind-like 3 or MBNL3, is very weakly expressed and not enriched in 
brain). Biological relevance is further supported by reports that mice mutant for Expl 
heparan sulfate-synthesizing enzyme or Spock3 heparan sulfate proteoglycan both have 
impairments in social interactions (42, 43). The top candidate for fear and aggression on 



chrl8 is Gustducin alpha ( GNAT3) which is a G alpha subunit that transduces signaling in 
taste cells and other chemosensory systems (including the vomeronasal organ), as well as 
in the brain. Because the fear and aggression traits include non-social fear (e.g., fear of 
loud sounds or novel objects), our working hypothesis is focused more on brain vs. 
chemosensory biology - particularly in the amygdala, hypothalamus, pituitary and 
hippocampus. The top candidate for fear on chrX is the gene Immunoglobulin 
superfamily member 1 ( 1GSF1) which causes a syndrome that includes congenital 
hypothyroidism and Growth Hormone deficiency when mutated in humans. The latter is 
interesting given that large dog size also maps to this same LD block (524 kbp 
downstream from our peak SNP for fear). While the biochemical function of IGSF1 is 
not well understood, it has multiple endocrine roles and is predominantly expressed in the 
pituitary and hypothalamus (also relevant here, it is expressed at much lower levels in the 
amygdala and adrenal gland). Studies of fear-relevant rat models showed that 
GNAT3 has among the highest gene expression changes in the amygdala after a pain 
stimulus (described in Results; 55), as IGSF1 does in the cortex after both stress and 
tactile stimulation (44). In summary, we propose that regulatory variants in the neuronal 
genes GNAT3 and IGSF1 (and possibly the latter’s flanking genes ARHGAP36 and 
FIRRE, as the three genes are co-regulated) affect canine behavior by modulating the 
function of the central fear circuit that transduces sensory information through the 
amygdala to the hypothalamic-pituitary-adrenal axis. 

These findings will require extensive experimental validation at the levels of 
canine genetics, neuroanatomy, physiology and behavior. Because the biochemistry and 
anatomy of fear and aggression are so highly conserved, it will also be possible to study 
these circuits in genetically-modified mice; but one must be mindful of the fact that mice 
are prey and dogs are predators. While the present study levearges the strengths of cross¬ 
breed genetics, it is now important to confirm and more precisely define the behavioral 
effects in breeds carrying both alleles at chrl8 or X. This will lead to a meaningful 
understanding of these traits - including their development (45) and our ability to affect 
them behaviorally or pharmacologically (46, 47). Given the sparcity of knowledge of 
human fear/aggression genetics, these dog findings present rich opportunities for 
translational psychology and psychiatry (48). 


Materials and Methods 
Genotype, phenotype and GWA data 

For the genotype data, we used a previously published SNP dataset (5). The dataset 
contains a total of 456 dogs from 30 breeds genotyped on the -175,000 SNP Illumina 
CanineHD array. GWAS markers were used to define the traits based on breed 
stereotypes for size and furnishings (5), osteosarcoma (49), and fear and aggression (22, 
23). Boyko et al. reported that a size variant lies somewhere in the 3.7Mb chrX region 
with the top-Fsr signal in the dog genome (12). We mapped the peak SNP in this region 
to chrX: 105,245,495 using the the Vaysse genotypes (5) and the same GWA methods and 
significance thresholds as in our companion study (Genome-wide Efficient Mixed Model 
Association algorithm or GEMMA (22, 50)). 



Phased haplotype analysis 

We used the top GWA marker for each trait (or for arbitrarily chosen control regions) to 
segregate individuals within each breed by their carrier status: heterozygous and 
homozygous. Continuing the analysis within breeds, the homozygotes for alleles A and B 
were analyzed further. To construct the largest common phased haplotype within a breed, 
we defined the boundaries by evaluating each SNP upstream and downstream, and 
keeping only SNP markers that have an allele frequency of at least 0.95. The largest such 
interval for each allele at the peak SNP was called a phased-haplotype block. In this work 
we only include within-breed phased-haplotype blocks if they were present in four or 
more dogs. Allele frequency calculations and data analyses were performed in PLINK 
v.1.07 (57). 

Si/Di blocks and gene annotation 

We used S) and D, data previously published by Vaysse et al. using the same genotype 
data (5). This information is available as supplementary material at 
http://dogs.genouest.org/SWEEP.dir/Supplemental.html . We converted all SNP positions 
from the original CanFam2 assembly to CanFam3.1 to take advantage of the Broad 
Institute’s CanFam3 Improved Annotation Data V.l (52). The improved gene annotation 
is available as a public track on the UCSC genome browser. 
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Fig. 1: Mapping phased-haplotypes and S,/D, regions for known canine GWA loci 

(A) . Analysis of small-size IGF1 variation (quantitative trait associated with T allele). 

(B) . Analysis of dog furnishings trait (binary trait associated with T allele). Red line is 
the peak GWA SNP. Large window sizes are 1.26 Mb in A and 1.48 Mb in B. Small 
window sizes are 422.3 kb and 439.9 kb, respectively. Green-white gradient shading 



represents larger-smaller dog body size in A, and presence/absence of dog furnishings in 
B. Diagonal lines indicate missing phenotype data. 


Fig. 2: Mapping phased-haplotypes and 5,/A' regions for fear/aggression traits. 

Quantitative trait with increased-fear/aggression associated with the C allele. Red line is 
the peak SNP for fear/aggression GWA. The large window size corresponds to 1.50 Mb, 
and small to 812.1 kb. Green-white gradient shading represents larger-smaller 
fear/aggression risk. Diagonal lines indicate missing phenotype data. 

Fig. 3: Mapping phased-haplotypes and 5,- /A regions for fear/aggression traits. 

Quantitative trait with decreased-fear/aggression associated with the G allele. Red line is 
the peak SNP for fear/aggression GWA, orange is for sociability peak and purple is for 
size peak. The large window size corresponds to 5.20 Mb, and small to 2.00 Mb. Green- 
white gradient shading represents larger-smaller fear/aggression risk. Diagonal lines 
indicate missing phenotype data. 
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Fig. SI. Phased haplotype analysis for two loci of unknown selection status 
Fig. S2. Mouse Gnat3 is highly expressed in the Amygdala and Piriform area 
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Fig. SI. Phased haplotype analysis for two loci of unknown selection status. (A) Region not 
associated with any known trait. (B) Dog osteosarcoma risk locus (binary trait). Large window 
sizes: 1.60 Mbp and 1.48 Mbp respectively. Small window sizes: 441.2 kb and 424.3 kb 
respectively. Green-white gradient shading represents percent of incidence of dog osteosarcoma 
according to literature. In (A), although the general patterns are unremarkable, the salient feature 
is that the largest phased haplotype (in black) is in the Border Terrier, and that is flanked by 
reduced variation (gray). This suggested the possibility that the highlighted haplotype could be in 
LD with a larger phased haplotype. In this case, the alternative allele is also present in the same 
breed and comparison of the two patterns shows a shared segment of reduced variation, which 
overlaps an S t interval unique to the Border Terrier. We tested this possibility and confirmed that 
there is a 391 kb phased haplotype that overlaps the S L region, and that is fixed in the Border 
Terrier; further investigation is necessary to consider the evolutionary basis for this observation. 
We also briefly considered the possibility that complex diseases that are common in dogs could 
be the result of selection for unrelated traits. We chose the locus associated with osteosarcoma 
risk (B) in Greyhounds because the risk allele is common across breeds and was shown to be 
fixed in two other breeds with high risk of this bone cancer, the Rottweiler and Irish Wolfhound 
[1]. That peak SNP in the CDNK2A/B locus on chrll (1.45 Odds Ratio) intersects S t regions in 
three breeds, two of those have high risk of osteosarcoma and the third does not (another has an S t 
region shifted ~10kb upstream); all four of those S t regions carry the osteosarcoma risk allele with 
the exception of the Newfoundland when both alleles are present. Phasing analysis is 
inconclusive because both the risk and alternative alleles in several breeds have haplotypes in the 
range of 50-100kb. However, it may be of interest that the minimal region of overlap across all 
risk haplotypes is significantly larger than the alternative haplotype despite the former being more 
common across breeds (37 vs. 21kb). This hints at the possibility that the risk allele was under 
selection long ago. The CDKN2A/B/AS1 locus is genetically and biologically very complex, with 
central roles in cell cycle control that affect the balance between sternness and senescence. Those 
coding and non-coding genes are among the most highly mutated in human cancer, but, in their 




































wild type forms, are also critical to diverse other aspects of human health. Based on the broad and 
key biological relevance, it seems likely that there was natural selection for several variants at this 
locus prior to breed creation. Some of those could be under purposeful or incidental human- 
selection in modern breeds (e.g., histiocytic sarcoma in Bernese Mountain Dogs [2]). It is thus 
notable that two breeds exhibit reduced variation in the haplotype phasing analysis (i.e., the top 
length of haplotype homozygosity shown in black, flanked by regions of reduced variation in 
gray), and both have S t signal in the region. 




Fig. S2. Mouse Gnat3 is highly expressed in the Amygdala and Piriform area. Analysis of the 
Allen Brain Atlas [3] of in situ mRNA hybridization data revealed the highest levels of Gnat3 
mRNA expression in the adult mouse to be in the Amygdala and Piriform area (AMY and PYR). 
The top image is Nissl staining and the bottom two are in situ hybridization; signal intensity is 
represented in a blue (low) to red (high) color spectrum (Atlas Gnat3 Image 1 of 19). The 
strongest signal is in layer 2 of the Cortical amygdalar area and of the Piriform area. The closeup 
image at the bottom shows two regions where the most intense color corresponds to the PIR layer 
2 (left area of solid red cells) and CO A layer 2 (right area of solid red cells); these are separated 
by an area of intermediate signal strength (orange and yellow cells). The other signal above and 
perpendicular to AMY/PIR corresponds to the choroid plexus of the lateral ventricle. Another 
brain area with robust signal is the pons, specifically, in the Lateral reticular nucleus (mixture of 






red and orange-yellow cells), Paragigantocellular reticular nucleus, lateral part (yellow and blue 
cells), and the Facial motor nucleus (yellow and blue cells) (see Atlas Images 10 and 13; data not 
shown). There is also weak to moderate signal in the mitral layer of the olfactory bulb. The 
Atlas series has uneven results, with most sections having very low signal. It is thus possible that 
some brain areas with detectable expression were missed in these experiments. However, the 
positive pattern shown in the closeup here is not suggestive of non-specific binding. For support 
of the image shown here, see also Atlas Image 5. 
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Abstract 


Background 

It has been an open question how similar human and canine lung cancers are. This has 
major implications in availability of human treatments for dogs and in establishing transla¬ 
tional models to test new therapies in pet dogs. The prognosis for canine advanced lung 
cancer is poor and new treatments are needed. Heat shock protein 90 (HSP90) is an 
ATPase-dependent molecular chaperone ubiquitously expressed in eukaryotic cells. 

HSP90 is essential for posttranslational conformational maturation and stability of client pro¬ 
teins including protein kinases and transcription factors, many of which are important for the 
proliferation and survival of cancer cells. We investigated the activity of STA-1474, a 
HSP90 inhibitor, in two canine lung cancer cell lines, BACA and CLAC. 


Results 

Comparative genomic hybridization analysis of both cell lines revealed genetic relevance to 
human non-small cell lung cancer. STA-1474 inhibited growth and induced apoptosis of 
both cell lines in a dose- and time-dependent manner. The ICs 50 after 72 h treatment with 
STA-1474 were 0.08 and 0.11 pM for BACA and CLAC, respectively. When grown as 
spheroids, the IC 50 of STA-1474 for BACA cells was approximately two-fold higher than 
when grown as a monolayer (0.348 pM vs. 0.168 pM), whereas CLAC spheroids were rela¬ 
tively drug resistant. Treatment of tumor-stromal fibroblasts with STA-1474 resulted in a 
dose-dependent decrease in their relative cell viability with a low IC 50 of 0.28 pM. 
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HSP90 Inhibition in Canine Lung Cancer 


Conclusions 

Here we first established that lung adenocarcinoma in people and dogs are genetically and 
biochemically similar. STA1474 demonstrated biological activity in both canine lung cancer 
cell lines and tumor-stromal fibroblasts. As significant decreases in relative cell viability can 
be achieved with nanomolar concentrations of STA-1474, investigation into the clinical effi¬ 
cacy of this drug in canine lung cancer patients is warranted. 


Introduction 

Cancer is the leading cause of mortality in dogs [1-3]. Lung cancer has an incidence of 15/ 
100,000 dogs per year [4] and it is generally a disease of older dogs with an approximate age of 
11 years old at the time of diagnosis. The most common histological subtype of canine lung 
cancer is adenocarcinoma, representing 74-77% of cases [5, 6]. Less than one-third of cases 
have localized disease, with 23% having distant metastasis, 13.5% lymph node metastasis and 
34.6% diagnosed with vascular/lymphatic or intrapulmonary spread [7]. Clinical staging is an 
essential requirement for determining prognosis and treatment and surgical excision is the 
most commonly used and effective treatment modality. Dogs with solitary tumors (Tl) have a 
median survival time after surgery of 348 days, while those that have multiple tumors of any 
size (T2) or tumors invading neighboring tissues (T3), have a dismal median survival time of 
only 58 days after a lung lobectomy [8]. 

The lack of good treatment options, other than surgery, has driven us to improve our under¬ 
standing of the molecular basis of this cancer in dogs, in order to enhance the development of 
novel and rational therapies for this disease. Currently selected nonsurgical treatment options 
for dogs with lung cancer are based on clinical acumen, drug—treatment responses noted in 
other adenocarcinoma tumor subtypes, extrapolation of drug responses in people, and a few 
drug toxicity studies that evaluated a very limited number of dogs with advanced lung cancer. 
Therefore, a significant need exists to provide a rationale for treatment selection based on more 
robust evidence from in vitro and in vivo models of canine lung cancer [9]. 

Lung cancer remains the most common cause of cancer-related mortality in people. People 
with advanced disease are treated with medical therapy alone and have a poor prognosis with 
an overall five-year survival less than 15% [10]. Discovery of a spectrum of gene mutations and 
genomic aberrations has led to the use of targeted therapies utilizing a precision medicine 
approach which has been associated with often dramatic, although often short-lived, clinical 
benefit [11,12]. Unfortunately, even in patients treated with first-line targeted therapy, resis¬ 
tance invariably develops, leaving chemotherapy as the cornerstone of subsequent therapy 
[13]. Pet dog translational models represent a major opportunity to better understand and 
treat human cancers, but lung cancer is the most common human cancer yet to be genetically 
dissected in dogs [14]. Because dog breeds are on the order of 100-fold more genetically simple 
than the human or dog populations, they are more powerful for understanding germline- 
genetic, environmental and gene-gene interaction risks [14]. Notably, the availability of state of 
the art human treatments for canine lung cancer is also dependent on this knowledge. 

Heat shock protein 90 (HSP90), a molecular chaperone protein, plays a central role in regu¬ 
lating the folding, stability and function of many proteins that are oncogenic drivers for lung 
cancers. HSP90 is a highly conserved protein that folds newly synthesized proteins into their 
biologically active conformations preventing their aggregation. HSP90 is expressed as a 90 kDa 
protein with two major isoforms (HSP90a and HSP90(3) and plays an essential role in 
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maintaining cellular protein homeostasis. Co-chaperones and client proteins can modify 
HSP90’s mechanism of action [15-17]. Tumor cells express high levels of HSP90, which exists 
in highly activated complexes that are particularly susceptible to binding HSP90 inhibitors 

[18] . Heat-shock proteins promote tumor cell survival, growth and metastasis, even in growth- 
factor deprived conditions, by allowing continued protein translation and cellular proliferation 

[19] . These proteins provide a mechanism whereby cellular stresses experienced by cancer cells 
are either managed or avoided. Many oncogenes, including tyrosine kinases, transcription fac¬ 
tors and cell-cycle regulatory proteins are clients of HSP90, and thus HSP90 is recognized as a 
crucial facilitator of cancer cell survival [20, 21]. 

Pharmacological blockade of HSP90, i.e. HSP90 inhibition, represents an alternative 
approach for therapeutic intervention, and has shown efficacy in both preclinical studies and 
clinical trials in people [22-24]. Geldamycin, a benzoquinone ansamycin antibiotic, binds to 
the nucleotide-binding site of the N-terminal domain of HSP90 preventing ATP binding, 
resulting in HSP90 inhibition. Geldamycin has poor solubility, stability and unacceptable liver 
toxicity in dogs at therapeutic doses therefore, analogues were developed [25]. STA-1474 is a 
highly soluble prodrug of ganetespib, a novel resorcinol-containing compound unrelated to 
geldamycin that binds in the ATP-binding domain at the N-terminus of HSP90 and acts as a 
potent HSP90 inhibitor. A phase I study with STA-1474 in dogs with cancer showed clinical 
activity with low grade gastrointestinal toxicity that was manageable with concomitant medica¬ 
tions [26] . Inhibiting HSP90 in lung cancer is appealing as no resistance mutations have been 
identified, suggesting it represents a relatively stable target for drug treatment. As little is 
known about the efficacy of cytotoxic and small molecule inhibitors in canine lung cancer, the 
purpose of this study was to characterize the activity of currently used chemotherapeutic agents 
and the small molecule inhibitors, torceranib phosphate, crizotinib and STA-1474 and the 
effects of HSP90 inhibition on the mRNA expression of relevant kinases and HSP90 client pro¬ 
teins in two canine lung cancer cell lines. Here we show that STA1474 demonstrated biological 
activity in both canine lung cancer cell lines and tumor-stromal fibroblasts. 

Materials and Methods 

Cell Lines and Reagents 

The BACA cell line was generously provided by Dr. Joseph J. Wakshlag, Cornell University 
College of Veterinary Medicine (Ithaca, NY). The BACA cell line was established from a histo¬ 
logically confirmed canine primary lung adenocarcinoma. Immunostaining of the cell line was 
positive for cytokeratin indicating epithelial origin [27]. The CLAC cell line was purchased 
through an approved materials transfer agreement with the Japan Health Sciences Founda¬ 
tions, JCRB Cell Bank (Osaka, Japan) [28]. Both cell lines were maintained in high-glucose 
Dulbecco modified Eagle medium (DMEM) with GlutaMax (Invitrogen, Carlsbad, CA) and 
supplemented with 10% heat-inactivated fetal bovine serum (FBS), and a penicillin- (100 I.U./ 
ml) streptomycin (100 pg/ml) solution. Cells were passaged at -90% confluence. In vitro exper¬ 
iments were performed when cells were -90% confluent. STA-1474 was kindly provided by 
Synta Pharmaceuticals® (Lexington, MA). Crizotinib and toceranib were provided by Zoetis™ 
(Groton, CT). Carboplatin (Teva Pharmaceuticals Ltd, Sellersville, PA), gemcitabine (Accord 
Healthcare, Durham, NC) and vinorelbine (Mylan Institutional, Rockford, IL) were purchased 
from The Ohio State University Veterinary Medical Center Pharmacy (Columbus, OH). 

Comparative Genomic Hybridization Array 

We custom designed a 966,903 feature comparative genomic hybridization (CGH) array tiling 
the canine genome (C.E.A and J.L.R, manuscript in preparation; see also [29, 30]. The array is 
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comprised of isothermal 60-nucleotide probes targeting all regions of unique or low copy 
repeats (i.e., with otherwise-unique sequence) based on the CanFam2 assembly (including the 
unmapped contigs annotated as chrUn). Average spacing of probes is 1.9kb for unique 
sequence and 1.2kb for low copy repeats. The CGH platform and probe design method is Agi¬ 
lent SurePrint G3 (it includes 7,113 additional Agilent control probes). DNA quality control, 
array hybridization and scanning were performed by Asuragen® (Austin, TX) under Agilent 
certified conditions. The two tumor cell line samples were compared against a healthy male 
Labrador retriever as the reference. The reference specimen was obtained under informed 
owner consent and the following Ohio State University IACUC approved protocol 
(2010A0015-R1, Canine Specimen Collection and Banking) which covered the procedure used 
to obtain the sample and their subsequent use for research application. Agilent uses a linear 
normalization process (including dye based normalization using copy-neutral normalization 
probes) for their LogR values. This data was imported into Golden Helix SNP and Variation 
Suite, and converted from LogRIO to LogR2. Sample quality metrics were performed, including 
percentile based Winsorizing, derivative log ratio spread, and wave detection/correction. For 
segmentation, we used the univariate Optimal Copy Number Analysis Module (CNAM in 
Golden Helix), which uses a change-point identification algorithm. While this powerful algo¬ 
rithm accurately identifies changes in sequential data, it is computationally intensive. We 
selected 10 max segments per 10,000 bases, 20-marker minimum for a copy number call, and a 
max pairwise permuted p -value of 0.005 (with 2000 permutations per pair). CNV calls were 
based on a logR2 ratio threshold of -0.40/0.40 for losses and gains, respectively (Excel work¬ 
sheet #1, SI Table). Because this segmentation algorithm is more sensitive to deletion copy 
number changes, we used a more stringent threshold for deletions for further analysis (-0.45 
for deletions and 0.4 for gains). Mean symmetric smoothing was applied to all figures. 


RNA Isolation and Reverse-Transcriptase Polymerase Chain Reaction 

Total RNA was isolated from both cell lines using the Absolutely RNA Miniprep Kit (Agilent 
Technologies, La Jolla, CA), according to manufacturer instructions. The RNA quantity and 
quality was assessed with a NanoVue Spectrophotometer (GE Healthcare, Piscataway, NJ). 
TaqMan® Reverse Transcription Kits (Applied Biosystems, Carlsbad, CA) were used to make 
cDNA for RT-PCR analysis of all gene transcripts. Primer sequences used for the RT-PCR 
reactions are listed in Table 1. Additional primer pair sequences used for RT-PCR reactions are 
referenced in Mariotti et al. [31]. The amplified cDNA products were separated according to 
size using gel electrophoresis. Amplicons were resolved on a 1% agarose gel to visualize the 
products. NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 1 (NDUFA1) was used as 
a housekeeping gene [32]. 


Sequencing and Sequence Alignment 

Standard PCR was used to generate high fidelity Taq polymerase-amplified PCR products. The 
resolved PCR products were extracted from the gel, purified using QIAquick PCR Purification 
Kit (Qiagen, Germantown, MD) and sequenced using BigDye™ Terminator Cycle Sequencing 
chemistry (Applied Biosystems, Carlsbad, CA). Automated Sanger capillary sequencing reac¬ 
tions for were run on a 3730 DNA Analyzer. Sequence alignments for the genes HSP70 , 
HSP90AA1 , HSP90AB1 , HSP90B1 , MET , NDUFA1 and NKX2-1 were made to the reference 
sequences NCBI: [Canis lupus familiaris (dog)] Gene ID: 403612, 480438, 474919, 404019, 
403438, 481033, and 403940 updated on 7-Dec-2014 and 29-Jan-2015) using the ClustalW 
procedure in DNASTAR Software Lasergene MegAlign® v.12.1 Madison, WI. All other primer 


PLOS ONE | DOI:10.1371/journal.pone.0142007 November 11,2015 


4/25 




®PLOS 

• • 

ONE 

HSP90 Inhibition in Canine Lung Cancer 

Table 1. Canine RT-PCR Primers. 

Gene 

Forward 

Reverse 

Product size (bp) 

AKT 

5'-TGCTTAAGAAGGACCCCAAGC-3' 

5'-GCTGGTCCAGTTCGAGGGA-3' 

253 

ALK 

5'-CTGTATCGGGGTGAGTCTGC-3' 

5'-CAGGGCCTGGACAGGTTAAG-3' 

245 

c-KIT 

5'-GATGGCCCCTGAGAGCATTT-3' 

5'-GCCTTTTCAGGGGATCAGCA-3' 

248 

c-MET 

5'-GAGGAATGTCCCACTGGAGC-3' 

5'-TGCTGTCCCTCGACCATTTG-3' 

297 

EGFR 

5'-TGGTCCTGGGGAATTTGGAA-3' 

5'-GGTTATTGCTGAAGCGCACA-3' 

289 

ErbB2/HER2 

5'-CCCGAGACCCACCTGGATA-3' 

5'-CAGGGCGTAGTTGTCCTCAA-3' 

228 

HSP70 

5'-AGCTGGAGCAGGTGTGTAAC-3' 

5'-GGGGAAGAAGTCCTAATCCACC-3' 

146 

HSP90AA1 

5'-ACCGAACTGGCTGAAGACAA-3' 

5'-GATCACTTCCAGGCCATGCT-3' 

285 

HSP90AB1 

5'-TGTCCGCCGTGTGTTTATCA-3' 

5'-GCGCCGGTTAGTGGAATCTT-3' 

280 

HSP90B1 

5'-TGAAACTGTTGAGGAGCCCA-3' 

5'-GTGACTTCCCCTTCAGCAGT-3' 

288 

NDUFA1 

5'-AATATTATAAATGGGAGGCGCGG-3' 

5'-TAGTGAACCTGTGGATGTGCG-3' 

168 

NKX2-1/TTF-1 

5'-GACGTGAGCAAGAACATGGC-3' 

5'-CAGATTTTGACCTGCGTGGG-3' 

298 

PDGFRa 

5'-CGACTCCGTTCTCAGTGTCT-3' 

5'-CTGTTCCGCATGGTGTCCT-3' 

223 

PDGFRP 

5'-CCACGCCTCTGACGAGATTT-3' 

5'-CTGCACAGCAGTGTACAGGA-3 

259 

RET 

5'-AGCAGGATACAC GAC CGTT G-3' 

5'-GTCCCGATGGACAAGCTTCA-3' 

384 


doi: 10.1371 /journal .pone.0142007.t001 


sets used to generate PCR products were previously sequenced and aligned to verify the ampli- 
cons [31] 

Cell Proliferation Assay 

To assess relative cell proliferation, cells were seeded in 96-well plates in 100 pi of DMEM sup¬ 
plemented with 10% FBS and incubated overnight. 2.5 x 10 3 and 4 x 10 3 cells per well were 
seeded for CLAC and BACA, respectively in order to achieve ~ 90% confluency. Plates were 
then treated with increasing concentrations of gemcitabine, vinorelbine, carboplatin, crizotinib, 
toceranib phosphate or STA-1474 and were evaluated after 72 h, using the CyQUANT® cell 
proliferation assay according to the manufacturer instructions (Molecular Probes, Eugene, 

OR). For each drug and concentration, six wells were used. Briefly, 72 h after treatment, media 
was removed by gently inverting the plates and the plates were frozen at -80°C. The following 
day, plates were thawed at room temperature (RT) and 200 pi of CyQUANT® GR dye/cell- 
lysis buffer was added to each well. Plates were incubated at RT for 5 min, protected from light 
and then fluorescence measurements were made using a plate reader (Molecular Devices, Sun¬ 
nyvale, CA), with excitation at 485 nm and emission detection at 530 nm. Relative cell number 
was calculated as a percentage of the control wells: absorbance of sample/absorbance of DMSO 
treated cells x 100.All proliferation experiments were repeated three times. 

Similarly, 4 x 10 3 cells of BACA and CLAC were seeded per well and incubated overnight 
before a 72 h treatment with increasing concentrations of VER155008 and STA-1474. Cell pro¬ 
liferation was evaluated with CyQUANT s cell proliferation assay as described above. The drug 
concentrations were selected based on predetermined half maximal concentration 50% (IC 50 ) 
values for each drug. For the VER155008 and STA-1474 studies, fixed constant ratio drug com¬ 
binations from 0.0625 to 16X, where X is the IC 50 were evaluated using a minimum of 9 data 
points which were each repeated in triplicate. The combinations were evaluated for synergism, 
additive effects or antagonism by median-effect analysis (CompuSyn Software v. 1, Inc, 
Paramus, NJ) [33]. The nature of the combinatorial interactions was evaluated using the combi¬ 
nation index (Cl) method. Briefly, cytotoxic effects of the drug combination are described by 
the equation f a /f u = [D/D m ] m where f a is the fraction of cells affected, f u is the fraction of cells 
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not affected (1-/J, D is the dose of drug, D m is the dose of drug to cause the median effect and m 
is the slope of the median-effect curve. The Cl value definitions are listed in the Table 2. 

Detection of Apoptosis 

Alexa Fluor 488 annexin V and 1 pi of PI solution (100 pg/ml) were used to stain cells for fluo¬ 
rescence-activated cell sorting (FACS). After 15 min of incubation at RT, 400 pi of annexin¬ 
binding buffer was added and the sample was gently mixed and kept on ice until analysis. Cells 
were analyzed within 30 min of staining. Caspase 3/7 activity was evaluated with the Senso- 
Lyte® Homogeneous AMC Caspase 3/7 Assay Kit (AnaSpec, Fremont, CA) according to the 
manufacturer’s instructions. Briefly, 4,000 cells per well were seeded in 100 pi of medium and 
incubated overnight. The next morning, cells were treated with STA-1474 (0.05-1 pM) for 24 
and 48 h. Then, 50 pi of the caspase 3/7 substrate solution was added to each well and mixed in 
a plate shaker for 60 min at 150 rpm and wrapped in foil to protect from direct light. Fluores¬ 
cence intensity was measured using a plate reader (Molecular Devices), with excitation at 245 
nm and emission detection at 442 nm. 

Protein Isolation 

For the measurement of HSP90 and HSP70, client proteins and phosphorylated forms in the 
BACA and CLAC treated cells, cells were grown to 90% confluence in 100-mm dishes, placed 
on ice, rinsed with ice-cold DPBS, and lysed with IX cell lysis buffer (#9803, Cell Signaling 
Technology®, Danvers, MA) with ImM phenylmethanesulfonyl fluoride and protease inhibi¬ 
tors (Halt Protease Inhibitor Cocktail Kit; Pierce, Rockford, IL) added just before use. Cells 
were scraped from the dishes, and the lysates were incubated in the buffer for 15 min on ice, 
and centrifuged for 20 min at 16,000 xg at 4°C. The supernatants were collected, and protein 
concentration was determined by a modified Bradford method (Bio-Rad Laboratories, Inc, 
Hercules, CA). 

Immunoblot Analyses 

For immunoblot studies, BACA and CLAC cells were seeded in triplicate at a density of 3.5 x 10 6 
cells per 100 mm plate and incubated overnight. All immunoblot analyses represent protein expres¬ 
sion after 72 h of treatment. Cells were treated with 0.05, 0.25, 0.75, and 1 pM of toceranib phos¬ 
phate and 0.005, 0.05,0.5 and 1 pM of STA-1474 and were incubated for 72 h. For immunoblot 
analysis, 3X Laemmli sample buffer with 1 mM (3-mercaptoethanol was added to 40 pg of protein 


Table 2. Combination index (Cl) value ranges with verbal descriptors. 


Cl 

Description 

<0.10 

Very Strong Synergism 

0.10-0.30 

Strong Synergism 

0.30-0.70 

Synergism 

0.70-0.85 

Moderate Synergism 

0.85-0.90 

Slight Synergism 

0.90-1.10 

Nearly Additive 

1.10-1.20 

Slight Antagonism 

1.20-1.45 

Moderate Antagonism 

1.45-3.30 

Antagonism 

3.30-10.0 

Strong Antagonism 

>10.0 

Very Strong Antagonism 


doi:10.1371/journal.pone.0142007.t002 
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extract (35-200 pg) at a final concentration of 33%, and the samples were heated at 100°C for 5 
min. Cell protein extracts were fractioned on a precast 12% NuPAGE® Novex® Bis-Tris polyacryl¬ 
amide Mini gels (10 cm x 10 cm) using NuPAGE® MES SDS running buffer for small proteins (2- 
200 kDa) or NuPAGE® MOPS SDS running buffer (Life Technologies, Grand Island, NY) for 
medium-size proteins (14-200 kDa) followed by electrophoretic transfer to nitrocellulose mem¬ 
branes (Pall Life Sciences, Ann Arbor, MI). The membranes were incubated overnight at 4°C with 
0.2% Tween-20 in TBS and 2% bovine serum albumin with rabbit monoclonal and polyclonal anti¬ 
bodies. The blots were incubated with secondary anti-rabbit IgG horseradish peroxidase—linked 
antibody in 0.2% Tween-20 in TBS and 2% nonfat dry milk for 1 h at RT. Primary antibodies were 
epidermal growth factor receptor (EGFR, Santa Cruz Biotechnology, Inc., #sc-03; 1:500) [34], 
STAT3 (antibody #610189, BD Transduction Biosciences, Franklin Lakes, NJ, 1:2000), HSP90 
(ADI-SPA-835,1:600), HSP70/HSP72 (ADI-SPA-810, Enzo Life Sciences, Inc, Farmingdale, NY, 
1:2000) insulin growth factor I (IGF-IR(3) receptor beta (#3027,1:500), HER2 (#4290S, 1:1000), 
phospho-AKT (#4060P, 1:2000), mTOR (#2972S, 1:1000), phospho-mTOR (#2971S, 1:1000), 
MAPK (#4695P, 1:1000), phospho-MAPK (4060P, 1:1000), phospho-STAT3 (#9134P, 1:250) 
phospho-S6 ribosomal protein (#221 IS 1:1000), S6 ribosomal protein (#2217S, 1:1000), (3-actin 
(#4970S, 1:1000), GAPDH (#5174P, Cell Signaling Technologies®, Danvers, MA, 1:1000)[35]. 

Two wells of each pre-cast gel were loaded with markers, one for protein electrophoresis transfer 
from gel to membrane marker (#10748-010, BenchMark™ Pre-stained protein ladder, Life Technol¬ 
ogies) and a protein standard marker (#LC5602, MagicMark™ XP Standard, Life Technologies). 

Spheroid and Fibroblast Proliferation Assays 

Self-assembled clusters of cell colonies cultured in a microenvironment where cell-cell interac¬ 
tions dominate over cell-substrate interactions were generated in the form of 3-D spheroids. 
Monolayer cultures were plated at the same time for comparison of treatment effects. In both 
cases, 5,000 cells per well were seeded and allowed to grow for three days. For the spheroid 
growth, ultra-low attachment plates (Corning® Costar® Ultra-low attachment multiwall 
plates, Sigma-Aldrich, St. Louis, MO) were used. For the monolayer model, cells were plated in 
96-well black flat-clear bottom plates (Greiner Bio-One GmbH, Frickenhausen, Germany). 

The wells that formed the perimeter of the 96-well plate (two outer rows) were filled with PBS 
to minimize edge effect. After 72 h of growth, cells were treated with DMSO or 0.005-10 pM of 
STA-1474 for 72 h. Then, viability was assessed with the CellTiter-Glo® Luminescent Cell Via¬ 
bility Assay (Promega, Madison, WI) according to manufacturer instructions. Briefly, spher¬ 
oids were disrupted and mixed by repeated pipetting, aspirated and transferred into a well of a 
black, flat-clear bottomed multiwell plate in 100 pi of media. Next, 100 pi of CellTiter-Glo® 
reagent was added to each well. Contents were mixed for 2 min using a shaker to induce lysis 
and plates were incubated at RT for 10 min before luminescence was recorded using a plate 
reader (Molecular Devices). Finally, luminescence was normalized to the control group and 
ICs 50 were calculated for each cell line and growth model. 

For the fibroblast assay, 1.2 x 10 4 cells were seeded per well in 500 pi of medium, incubated 
for 24 h and then treated with STA-1474 (0.001-1 pM) for 72 h. Viability was assessed with the 
CyQUANT® Assay (Molecular Probes) according to the manufacturer instructions. Fluores¬ 
cence was measured using a plate reader (Molecular Devices), with excitation at 485 nm and 
emission detection at 530 nm and results were normalized to the control group. 

Statistical Analyses 

Experiments were performed three times and the data presented as mean values ± SD. Statisti¬ 
cal analysis of significance was performed using the One-Way ANOVA followed by Bonferroni 
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test. For non-normal distribution, a Kruskal-Wallis test followed by Dunns test was used. IC 50 
calculations were made using a logarithmic regression curve with Prism® 5 for Mac OS X 
(GraphPad Software, Inc. La Jolla, CA). P -values <0.05 were considered statistically significant. 

Results 

Structural variation in the BACA and CLAC cell lines authenticates their 
relevance as a comparative oncogenomic model for human NSCLC 

Comparative genomic hybridization was conducted on BACA [27] and CLAC [28] to establish 
the genes affected by genomic alterations or Copy Number Alterations (CNAs; Fig 1A) [14, 29, 
30]. Stringent thresholds (0.4 and 0.45 Log2 ratios) and high minimum-number of probes per 
CNA segment (20 probes) were applied. All CNAs are provided in SI Table, segregated accord¬ 
ing to cell line and size (focal, defined as <3Mb, and large). Many large alterations affect 
known cancer driver genes in these cell lines. For example, chrl3, which contains the MYC 
gene that is commonly amplified in human lung adenocarcinoma, has 2-copy gains in both cell 
lines. Both cell lines have 2-copy loss of the most commonly deleted genome segment in 
human lung adenocarcinoma—which contains the genes CDKN2A/B/B-AS1 (one focal, the 
other larger CNA; Fig IB). BACA has large CNA deletions of the lung adenocarcinoma tumor 
suppressor PIK3R1 (<1% of human cases) [36] and the pan cancer tumor suppressor CASP3. 
CLAC has a large CNA gain including NRAS , a known lung adenocarcinoma driver in 0.4% of 
human cases [36]. CLAC also has gain of the lung adenocarcinoma oncogene CCND1 associ¬ 
ated with 4% of human cases. 

Because the large CNAs contain very high numbers of presumptive bystander genes, it is 
not straight forward to evaluate all as potential oncogenic drivers. However, it is possible to 
study focal alterations—here arbitrarily defined as <3Mb—to implicate known cancer genes 
and pathways. Table 3 shows the results of Cancer Gene annotation system for Cancer Geno¬ 
mics (CaGe) analysis of all identified CNAs [37]. The table was also annotated by manual anal¬ 
ysis to consider whether a gene is likely to be a tumor suppressor or oncogene [38]. Using our 
conservative criteria to minimize false positives, the total number of genes affected by focal 
alterations is 263. Of those, 129 genes were called as either cancer drivers (89 genes) or pathway 
genes (40). We determined that at least 28 and 13 of those genes, respectively, are gained or 
lost in the predicted direction to be oncogenic (i.e., gain of oncogenes and loss of tumor sup¬ 
pressors). Two of those genes -CDKN2A (Fig IB) and LRP1B -are reported to be significantly 
mutated in lung adenocarcinoma (but the latter, called only in the earlier study, may have been 
due to the large size of the gene) [36]. Gene Set Enrichment Analysis (GSEA) of all focal CNA 
genes yielded the top match by significance as genes altered in a complex therapeutic model 
(GSEA gene set name Zhang_antiviral_response_to_ribavirin_dn) applied to the human lung 
adenocarcinoma A549 cell line; another top hit was genes down-regulated by stable expression 
of SEMA3B in the human lung adenocarcinoma cell line HI299 (p-values/false detection rate 
^-values of 4.5E-06/3.27E-02 and 2.99E-05/4.65E-02). While those mechanistic studies with 
human cell lines may not be directly relevant to the canine BACA and CLAC cell lines, this 
finding and the implicated driver genes mentioned indicate that the dog lines are relevant to 
human lung adenocarcinoma. 

Cancer Gene Census (CGC), Wellcome Trust Sanger Institute (http://cancer.sanger.ac.uk/ 
census/; Futreal etal. PMID: 14993899); Cancer Gene Index (CGI), National Cancer Institute 
(https://wiki.nci.nih.gov/display/cageneindex/Cancer-i-Gene-i-Index-i-End-i-User 
-i-Documentation); bold, genes in the correct direction of gain/loss to be oncogenic or tumor sup¬ 
pressive (according to analysis of pan cancer sequence mutation profile by Davoli et al. PMID: 
24183448); italics and underline, genes that can be either oncogenic or tumor suppressive 
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Fig 1. Comparative genomic hybridization analysis of canine BACA and CLAC lung adenocarcinoma cell lines. A custom 1M feature array was used 
to measure the DNA copy number across the genome relative to germline DNA from a normal dog. The zero line corresponds to a ratio of 1:1 between test 
DNA and a male reference DNA, or generally a copy number of 2. Gains are above the line and losses below. (A) Whole genome copy number analysis is 
shown with BACA on top (red) and CLAC on bottom (blue). Among the important findings, both cell lines have a 2-copy loss of CDKN2A/B/B-AS1 (encoding 
tumor suppressors pi 4-pl 9/ARF and pi 6/INK4; also shown in close-up in B panels) and a 2-copy gain of chrl 3 (which contains the oncogene MYC). Other 
large alterations are a 1 -copy loss of most of chr2 and chrl 6 in BACA and a 1 -copy gain and loss of chrl 7 and chrl 1, respectively in CLAC. Most of all other 
variants shared by the two cell lines represent Copy Number Variation (CNV) in the reference DNA. (B) Copy number analysis of complete chrl 1 shows 
BACA has a 1 -copy gain over 2/3rds of its length and a 1 -copy loss over the remainder and CLAC has a 1 -copy loss over its full length. Both lines have a 
relatively small 2-copy loss (focal in BACA) that overlaps only CDKN2A/B/B-AS1 between the two cell lines. 

doi: 10.1371 /journal .pone.0142007.g001 


(according to Davoli et al PMID: 24183448); asterisk, genes among 26 significantly mutated 
genes in human lung adenocarcinoma (FDR <0.1; according to Ding et al PMID: 18948947) 


cDNA transcripts are present for NKX2-1, HSP70 and HSP90 isoforms, 
and HSP90 client proteins 

Reverse transcriptase-polymerase chain reaction (RT-PCR) was used to identify the presence 
of mRNA for NKX2-1, also known as TTF-1, a marker of lung differentiation, and eleven 


Table 3. Gene annotation summary of focal copy number alterations in BACA and CLAC. 


Annotation 

Category 

Genes 

CGI Genes 

TSC2, CDK6, CDKN2A*, SMO, CHEK2, BCL6, LPP, NF2, RANBP17, CRTC3 

CGI (not CGC) 

ARHGDIG, CARD8, CTSC, DUSP10, IRF5, MAGI2, MAPK8IP3, NRF1, PEX1, 
RAB11FIP3, RPS27L, TUSC3, UBE2E3, UBE2H, UBE2I, AHCYL2, CYP51A1, 
KAT2B, MAD2L1, TNFRSF1B, AXIN1, NME3, PARK2, HMGCS2, SMS, ATP6V1F, 
IL1RAP, FLNC, SDC1, SGOL1, CALL), RAB5A, APOB, SSTR5, CACNA1H, 
IMPDH1, IQGAP1, CLDN16, LEP, CCL20, CDKN2B, MPG, NTHL1, CLDN1, DMD, 
RHOB, API B1, TRIO, B3GALNT1, GALNT3, ARHGAP27, ATAD3B, BAIAP3, 
CCDC136, DNAH5, DYNLRB2, FAM190A, GFER, KRIT1, LRP1B*, LRRC4, 
MRPL28, MSLN, MSR1, NLRP5, OCA2, OSR1, PKD1, PKP4, PNN, RASL10A, 
RHBDD3, RHBDF1, SCN9A, SMC4, SOX8, SSU72, TP63, ZFHX3 

CGC & CGI Pathway 

ARHGAP15, ARHGAP24, DECR2, DIAPH2, GDF7, GNPTG, HERC2, LAPTM4A, 
NEFH, PIWIL2, SEC23A, SLC9A3R2, SPHKAP, STUB1, IGFALS, PPM1L, PIGQ, 
IL12A, RPL38, RPL3L, GEMIN2, MASP1, GNG13, OPN1SW, PPP3CC, POLR3D, 
CTNNA3, NDUFB10 

CGI (not CGC) 
Pathway 

SLC39A14, HAGHL, SLC19A3, KYNU, OR14C36, OR14I1, OR2G6, OR2T11, 
OR2T27, OR2T6, OR51 FI, SLC39A14 


doi :10.1371 /journal.pone.0142007.t003 
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receptor and cytoplasmic tyrosine and serine/threonine kinases, in the BACA and CLAC 
canine lung cancer lines. Both cell lines expressed NKX2-1 (Fig 2A). Ten of eleven HSP90 cli¬ 
ent kinase cDNA transcripts investigated for expression were present in both cell lines. EGFR, 
c-Kit, HER2, VEGFR, PDGFRa/p, c-MET, MAPK, c-RET, AKT-2 were present and transcripts 
for ALK were absent (Fig 2A). Both cell lines had transcripts of HSP70 and the HSP90 isoforms 
(Fig 2B). 

STA-1474 inhibits BACA and CLAC cell line proliferation 

Cell viability was assessed after 72 h of treatment with increasing concentrations of drugs com¬ 
monly used to treat lung cancer in humans and dogs (Fig 3A and 3B). IC 50 values were deter¬ 
mined for each drug in both cell lines. Effects on cell proliferation were generally dose- and cell 
line-dependent. With respect to currently used cytotoxic chemotherapeutics, treatment of the 
BACA line with vinorelbine achieved the lowest IC 50 (0.729 pM). In contrast, when the CLAC 
line was treated with increasing concentrations of vinorelbine for 72 h, an IC 50 was never 
reached, i.e. this cell line was relatively resistant (Fig 3A). Treatment of the CLAC line with car- 
boplatin did not result in a dose-dependent decrease in cell viability until a concentration of 
100 pM was reached. Increasing doses of carboplatin did result in decreasing BACA cell viabil¬ 
ity. Gemcitabine ICs 50 for the BACA and CLAC lines were lower than those achieved with car¬ 
boplatin. Statistically significant decreases in BACA cell viability at 72 h were present for all 
concentrations of gemcitabine used. The IC 50 for gemcitabine-treated CLAC cells was approxi¬ 
mately three times greater than that of BACA. Overall, BACA cell viability was more sensitive 
to these cytotoxic drugs as indicated by the lower ICs 50 compared to the CLAC line. 

Of the three different small molecule inhibitors evaluated, STA-1474, the HSP90 inhibitor, 
achieved the greatest inhibition of cell viability and had the lowest ICs 50 after 72 h of drug 
exposure (Fig 3B). BACA cell viability was minimally affected by torceranib phosphate treat¬ 
ment. A concentration-dependent decrease in CLAC viability was found with increasing con¬ 
centrations of toceranib resulting in a lower IC 50 (0.47pM) when compared to BACA. 
Crizotinib treatment of the CLAC cell line did not produce a statistically significant decrease in 
viability which resulted in an unachievable IC 50 . Although critzotinib treatment of the BACA 
cell line produced a significant (***p<0.001) decrease in viability at both the 1.4 pM and 
1.6 pM concentrations, the cell viability was higher than 50% with the doses used. 

STA-1474 promotes apoptosis in a time- and dose-dependent manner 

To determine if the growth inhibitory effect of STA-1474 on both cell lines was associated with 
apoptosis, cell lines were treated with increasing concentrations of STA-1474 (0.005-0.05 pM) 
for 24 h and evaluated for annexin V and propidium iodide (PI) staining. Although incubation 
of the BACA line with 0.05 pM STA-1474 for 24 h increased the proportion of cells undergoing 
apoptosis (represented by both annexin V and annexin V & PI positivity), this was not statisti¬ 
cally significant (Fig 4A). In contrast, 24 h exposure of the CLAC line with 0.05 pM STA-1474 
resulted in a significant (p <0.05) increase in the proportion of apoptotic cells. Characteriza¬ 
tion of the identified apoptotic response was evaluated further by quantifying the executioner 
caspase activity of caspases 3 and 7 after 24 and 48 h exposures to vehicle (DMSO) or increas¬ 
ing concentrations of STA-1474 (0.05-1 pM). Twenty-four and 48 h treatments of the BACA 
line with STA-1474 resulted in a significant dose-dependent increase in caspase activity (Fig 
4B, middle left panels). A dose-dependent significant increase in caspase 3/7 activity was seen 
in the CLAC line after 48 h of drug exposure (Fig 4B, middle right panels). The relative increase 
in caspase3/7 increased significantly (p<0.001) between 24 and 48 h in the CLAC line but not 
in the BACA line (Fig 4B, lower panels). 
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Fig 2. Characterization of BACA and CLAC canine lung cancer lines by RT-PCR. (A) Reverse 
transcriptase cDNA transcripts of NKX2-1 (lung cancer cell marker) and HSP90 client proteins in BACA and 
CLAC cell lines. NDUFA1 serves as a loading control. (B) Reverse transcriptase cDNA transcripts of HSP70 
and three HSP90 isoforms in BACA and CLAC cell lines. NDUFA1 serves as a loading control. 


doi: 10.1371 /journal, pone.0142007.g002 


STA-1474 decreases expression of signal transduction proteins and up- 
regulates HSP70 expression in canine lung cancer cells in a dose- 
dependent manner 

To further characterize the in vitro activity of STA-1474 in comparison to torceranib phos¬ 
phate, we assessed the ability of these compounds to deplete critical client proteins of HSP90 
and multiple receptor tyrosine kinase targets of torceranib. We also evaluated the ability of 
these compounds to extinguish downstream signaling of the PI3K/mTOR/S6 and RAF/MEK/ 
ERK pathways and the transcription factor, STAT3. Treatment of the BACA cell line with bio¬ 
logically relevant concentrations of STA-1474 induced client-protein depletion of HER2 (start¬ 
ing at 0.05 pmol/L) and STAT3 (at 1.0 pM). Degradation of the phosphorylated forms of the 
downstream signaling proteins, pAKT, pMAPK, pS6, and transcription factor, pSTAT3, 
occurred after exposure to 1.0 pM in the BACA cell line (Fig 5A); whereas in the CLAC cell 
line, decreased phosphorylated forms of the same proteins occurred at just 0.5 pM (Fig 5B). 
Interestingly, protein levels of pmTOR were unaffected by exposure to biologically achievable 
doses of STA-1471 in the BACA line and only slightly affected the CLAC line. Toceranib treat¬ 
ment of the BACA cell line resulted in a slight decrease in HER2. Degradation of all other pro¬ 
teins and phosphorylated forms were unaffected at exposures <10 pmol/L (Fig 5A and 5B). 

As inhibition of HSP90 typically leads to the increased expression of other HSP family 
members that can be used as surrogates for HSP90 inhibition, we evaluated the effect of HSP90 
abrogation on HSP70 expression. Treatment of both cell lines with increasing concentrations 
of STA-1474 (0.005-1.0 pM) for 48 h resulted in an increase in HSP70 expression at 0.5 and 
1.0 pM exposures (Fig 5C, left panels). As expected, treatment of both cell lines with increasing 
concentration of toceranib did not increase HSP70 expression (Fig 5C, right panels). 
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Fig 3. Viability assays and half-maximal inhibitory concentrations (ICs 50 ) for canine lung cancer cell 
lines. Cells were treated with increasing concentrations cytotoxic drugs (A) or small molecules inhibitors (B) 
and proliferation was evaluated after 72 h. Treatment effects were normalized to the drug vehicle-treated 
control group. Each graph shows mean±SEM. ICs 50 were calculated for each experiment. The dotted line in 
the y-axis represents the 50% relative viability. *, p<0.05; **, p<0.01; ***, p<0.001. 

doi:10.1371/journal, pone.0142007.g003 


Synergistic cytotoxicity from dual inhibition of HSP70 and HSP90 is dose 
and cell-line dependent 

The binding of a client protein to HSP90 requires the cooperation of HSP90 with other chaper¬ 
one proteins, HSP70 and HSP40. As induction of HSP70 was present in both cell lines after 72 
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Fig 4. Evaluation of apoptosis in canine lung cancer cell lines treated with STA-1474. Apoptosis was assessed by annexin V/PI staining flow cytometry 
and detection of caspase 3/7 enzymatic activity. (A) Cells were treated with dimethyl sulfoxyide (DMSO, control) or 0.005-1 pM of STA-1474 for 24 h. 

Staining with annexin V and the vital dye, propidium iodide (PI), were used to evaluate early and late apoptosis. Cells that are considered viable are both 
annexin V and PI negative, while cells that are in early apoptosis are annexin V positive and PI negative, and cells that are in late apoptosis or already dead 
are both annexin V and PI positive. (B) Both cell lines were treated as above and evaluated for executioner caspase-mediated apoptosis. Activated caspases 
3 and 7 were assessed 24 and 48 h after treatment. Experiments were performed in triplicate and repeated three times. Each graph shows mean ± SEM. *, 
p<0.05; **, p<0.01; ***, p<0.001. 


doi: 10.1371 /journal .pone.0142007.g004 
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Fig 5. Protein expression of HSP90-regulated proteins, HSP70 and HSP90 in canine lung cancer cell 
lines after treatment with small molecule inhibitors. A set of three plates for each cell line was used for 
evaluation of total and phosphoproteins of downstream signaling pathways (A) Representative immunoblots 
of HSP90 client protein expression from BACA whole cell protein lysates after treatment with STA-1474 or 
toceranib phosphate. (B) Representative immunoblots of HSP90 client protein expression from CLAC whole 
cell protein lysates after treatment with STA-1474 or toceranib phosphate. Controls were cell lines treated 
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with the drug solvent, DMSO, as represented by the 0 concentration. Evaluation of phosphoprotein forms of 
the proteins are indicated by “p”. Drug concentrations are pmol/L. The (3-actin Western blots serve as loading 
controls. (C) Immunobloting from whole cell protein lysates of HSP70 and HSP90 of BACA and CLAC lines 
treated with DMSO (control), STA-1474 and toceranib phosphate. 

doi:10.1371/journal.pone.0142007.g005 

h treatment with STA-1474, we sought to determine if synergistic cytotoxicity would occur 
when cells were treated with the combination of a HSP90 inhibitor and a HSP70 inhibitor. Cell 
line viability was assessed after 72 h of treatment with increasing doses of the HSP70 inhibitor, 
VER155008 (0-30 pM). There was a dose-dependent reduction in viability of both cell lines 
after treatment with VER155008 without a change in the expression of HSP70 or HSP90 in 
either cell line (Fig 6A). Treatment of BACA and CLAC cells for 72 h with a combination of 
VER155008 and STA-1474 inhibitors in concentrations that were constant ratios of multiples 
of the ICs 50 reduced relative cell viability in a dose-dependent manner (Fig 6B). Dose-effect 
and associated Cl values for the drug combination treatment (VER155008/STA-1474) at con¬ 
stant ratios are presented in Fig 6C. The model is most accurate at the/ a = 0.50, the point that 
the drugs affect 50% of the cells. BACA cells treated with VER155008/STA-1474 combinations 
were synergistic at f a = 0.50, and with increasing two-fold IC 50 concentration combinations, 
the Cl values increased to produce moderate synergism at effect levels > 50%. The dose reduc¬ 
tion index (DRI) was calculated and is a measure of how many folds the dose of each drug in a 
synergistic combination may be reduced at a given effect level when compared with the doses 
of each drug alone. The BACA line first showed evidence of synergism to the drug combination 
at f a = 0.50 with a Cl of 0.70. The DRI at/ fl = 0.50 was 1.16 and 3.90 fold for VER155008 and 
STA-1474, respectively. The VER155008/STA-1474 treatment in the CLAC line provided syn¬ 
ergism at the/ a = 0.50, and progressed to antagonistic at effect levels > 90%. The DRI at/ a = 
0.50 was 1.5 and 10.5 fold for VER155008 and STA-1474, respectively. Unfortunately, the 
VER155008 concentration needed to decrease cell viability was too high to take advantage of 
the drug combination in terms of obtaining the desirable therapeutic effect by administering 
low doses of the drugs in combination. These results demonstrate that VER155008/STA-1474 
combination has a limited dose range in which synergism is obtained and the concentrations 
are cell line dependent. 

BACA and CLAC cells were plated in DMEM media and were exposed to the compounds 
for 72 h at constant fixed ratios of [75:1](VER155008:STA-1474 CLAC) and [169:1] 
(VER155008:STA-1474 BACA) and relative cell viability assessed. The resulting Cl values are 
shown for effect level. For example, the doses of these two drugs needed to achieve a 97% 
decrease in BACA relative viability gives a Cl value that would indicate moderate synergism 
which provides a drug reduction index score indicative of a four-fold decrease of VER155008 
and two-fold decrease of STA-1474. 

Relative cell viability after treatment with STA-1474 differs in monolayer 
cultures vs. tumor spheroids and STA-1474 decreases tumor-stromal 
fibroblast viability 

To determine the efficacy of STA-1474 on canine lung cancer cells in different model systems, 
we evaluated the STA-1474 ICs 50 for cells grown as monolayers or as three-dimensional tumor 
spheroids. Both BACA and CLAC cells formed tumor spheroids when grown for three days in 
ultra-low attachment well plates (Fig 7A). BACA cells formed compact, tight spheroids, 
whereas CLAC cells formed loose spheroids. Different STA-1474 ICs 50 were obtained depend¬ 
ing on the model used. Monolayer cultures were seeded and then allowed to grow for 72 h 
before treatment to mimic the growth period needed for the formation of spheroids. BACA 
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Fig 6. Relative cell viability assays, half-maximal inhibitory concentrations (ICs 50 ) after treatment with an HSP70 inhibitor (VER155008) or a 
combination of STA-1474 and VER155008. (A) Cells were treated with HSP70 inhibitor, (VER155008) for 72 h. The relative viability and ICs 50 were 
determined for both cell lines after treatment. (B) BACA or CLAC cells were plated in DMEM media for 24 h and then treated for 72 h with VER155008 only 
(gray squares), STA-1474 only (black circles) or a combination of VER155008 and STA-1474 concentrations (red triangles) ranging from 0.0625X to 16X 
their IC 50 concentrations. Each graph shows mean ± SEM. Each group was compared to the DMSO control. The dotted line in the y-axis represents the 50% 
relative viability. *, p<0.05; **, p<0.01; ***, p<0.001. (C) Multi-drug combination dose-effect analysis for the doublet combination of VER155008 and STA- 
1474 on BACA and CLAC cell lines as measured by the combination index (Cl). The Cl value definitions are represented in the description column. 


doi: 10.1371 /journal .pone.0142007.g006 


cells grown as a monolayer, demonstrated a dose-dependent decrease in viability with an IC 50 
of 0.168 pM and <5% of cells viable at drug concentrations of 0.5 pM or higher. When grown 
as spheres, BACA cells were less sensitive to STA-1474 drug treatment, resulting in a doubling 
of the IC 50 (0.348 pM) compared to monolayer cultures. CLAC cells grown as a monolayer had 
a dose-dependent decrease in viability that was significant at doses of 0.5 pM or higher. CLAC 
spheroid cultures were resistant to all drug treatment concentrations investigated (up to 
10 pM; Fig 7B). We also evaluated the relative cell viability of tumor-stromal fibroblasts after 
72 h of treatment with STA-1474. There was a dose-dependent decrease in cell viability that 
was significant with concentrations of > 0.1 pM. The determined IC 50 was higher than for 
both canine lung cancer cells lines (Fig 7C vs. Fig 3B). 


Discussion 

The prognosis for dogs with advanced lung cancer remains poor and new treatment options 
are needed. The molecular characterization of canine lung cancer is limited compared to 
human lung cancer. In this study we wanted to investigate in canine lung cancer cells the activ¬ 
ity of some of the molecularly targeted therapies used to treat lung cancer in people, as well as 
investigate other chemotherapy drugs used to treat lung cancer in dogs. The biologic activity of 
these different chemotherapies and small molecule inhibitors on canine lung cancer cell lines 
has not been reported previously and could provide guidance and insight into the choice of 
drugs for treating dogs with lung cancer. 

Recent reports of HSP90 inhibitor activity in human non-small cell lung cancer (NSCLC) 
phase Ilb/III clinical trials have provided compelling rationale for investigating the feasibility 
of using HSP90 inhibitors for treatment of lung cancer in dogs [24]. HSP90 is a highly con¬ 
served protein that folds newly synthesized proteins into their biologically active conforma¬ 
tions preventing aggregation. HSP90 also maintains cellular protein homeostasis by acting as a 
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Fig 7. Viability and half-maximal inhibitory concentration (ICs 50 ) of STA-1474 treated canine lung 
cancer cell-line derived tumor spheroids, monolayers and tumor-stromal fibroblasts. (A) Tumor 
spheroids derived from both cell lines were allowed to form for 72 h after seeding cells in the ultra-low 
attachment wells. Images taken of one field of view/well on an inverted microscope at 40X magnification, 
scale bar represents 200 pM (B) Immediately after formation, tumor spheroids were treated with DMSO or 
increasing concentrations of STA-1474 for 72 h. Tumor spheroids and monolayers from each cell line were 
grown and treated identically. Cultured cells were treated with STA-1474 for an additional 72 h and ICs 50 
were calculated. (C) Tumor-stromal fibroblasts were seeded, allowed to form a monolayer for 24 h then 
incubated with STA-1474 for 72 h and viability determined. Experiments were performed in four replicates 
and repeated twice. Each graph shows mean ± SEM and each group was compared to DMSO. The dotted 
line in the y-axis represents the 50% relative viability. *, p<0.05; **, p<0.01; ***, p<0.001. 


doi:10.1371/journal.pone.0142007.g007 
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molecular chaperone with its action modulated by co-chaperones and client proteins [15-17]. 
As HSP90 regulates multiple signaling cascades, the effects of pharmacological blockade of 
HSP90 should interfere with a variety of client proteins and biochemical pathways. Given that 
we know very little about the current crucial signaling pathways for canine lung tumor viability 
and considering we could target multiple signaling proteins with HSP90 inhibition, this 
prompted us to evaluate the pro-drug, STA-1474, in these canine lung cancer cell lines. Our 
initial experiments were designed to characterize the suitability of our cell lines to serve as 
appropriate in vitro models that would potentially respond to therapeutic intervention by inhi¬ 
bition of selected targets. 

Somatic mutations are the predominant mechanism that gives rise to cancer. The average 
cancer cell has approximately four sequence mutations of oncogenes (mean 1) and tumor sup¬ 
pressors (3), 11 very large CNAs involving whole chromosomes (2 gain, 2 loss) or chromosome 
arms (3 gain, 5 loss), and 23 focal CNAs (11 gains, 12 losses) [38]. Here we have determined the 
CNAs for BACA and CLAC using high resolution array CGH. The most striking findings were 
2-copy gains of chrl3, which contains MYC, and 2-copy loss of a small part of chrl 1, for which 
the overlapping segment between the two cell lines includes only CDKN2A/B/B-AS1. These are 
among the most common CNAs seen in human lung adenocarcinoma [36], with CDKN2A 
being involved in 43% of cases. Notably CDKN2A encodes ARF, which directly interacts with 
overexpressed MYC protein to block its transformation and proliferation activities [39]. Other 
large CNA genes gained or lost in the correct direction to drive cancer are PIK3R1 and CASP3 
in BACA, and NRAS and CCND1 in CLAC [36]. Table 2 shows the many cancer driver and 
pathway genes affected in the cell lines by focal CNAs, including TSC2, NF2, BCL6, CHEK2 , 
CDK6 , KAT2B , PKD1 and TP63. Among the mechanistic insights, the CNAs suggest relevance 
for HSP90 inhibition that would be expected to have therapeutic effects through the PI3K and 
MAPK pathways (e.g., PIK3R1 , TSC2, BCL6 , NF2, PKD1 , and NRAS). Other pathways that are 
likely to be affected according to our findings are cell cycle progression (e.g., CDKN2A and 
CCND1) and apoptosis ( CASP3 ). The array CGH data thus support both lung adenocarcinoma 
and pan cancer relevance. Additionally, the GSEA analysis of these two canine cell lines 
strongly implicates human lung adenocarcinoma among all other cancer data. These facts for¬ 
mally demonstrate that BACA and CLAC can serve as comparative oncogenomic models for 
development of drug treatments to mammalian lung adenocarcinomas in which a set of com¬ 
mon driver mechanisms are present. 

Once we confirmed that the cell lines were appropriate in vitro models to evaluate candidate 
drugs that have been effective in controlling human NSCLC, it was important to establish that 
these canine lung cancer cell lines expressed HSP90 isoforms, HSP70, and various receptor 
tyrosine kinases and downstream kinases, relevant to lung cancer in humans. These included 
EGFR, c-Kit, HER2, VEGFR2, PDGFRoc, PDGFR(3, c-Met, MAPK, c-Ret and Akt. The cell lines 
used for this work were from primary canine lung tumors and of pulmonary cell differentiation 
as the presence of cDNA transcripts for TTF-1, a nuclear protein [40]. TTF-1, also known as 
NKX2-1, is a homebox-containing transcription factor essential for the development of the 
lung, and its use as a marker of lung adenocarcinoma, has been recommended by the newer 
classifications of human NSCLC expressed in follicular cells of the thyroid gland and pneumo- 
cytes was present. TTF-1 has a specificity of 100% and sensitivity of 85% in canine primary 
lung cancer [10, 11, 41]. Both cell lines had cDNA transcripts for HSP70, HSP90, and all the 
client proteins and other kinases investigated except ALK. 

When complete surgical resection of a primary lung tumor is not possible in canine patients, 
treatment with cytotoxic chemotherapy may be considered in an attempted to slow the pro¬ 
gression of the disease. Interestingly, the cytotoxic drugs used in this study had limited to no 
apparent effect on cell viability in vitro , or only had effects at drug concentrations not thought 
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to be biologically relevant, or achievable, in vivo. For example, the IC 50 of vinorelbine for 
BACA was 0.72 pM, which is 4- to 10-fold higher than in human NSCLC [42] and the CLAC 
cell line was drug-resistant. The responsiveness of our cell lines is consistent with what has 
been reported in the clinical setting, where only two out of seven dogs with macroscopic 
bronchoalveolar carcinoma had a partial response to vinorelbine treatment [43]. The obtained 
CLAC IC 50 dose for carboplatin is not tolerable in dogs. The tolerated carboplatin IC 50 has 
been extrapolated from pharmacokinetic data performed in laboratory beagle dogs using C max 
at the recommended dose of 300 mg/m 2 given as an IV bolus [44], An estimated maximum tol¬ 
erated dose of carboplatin is 250 pM [44] . Likewise, a previous pharmacokinetic study evaluat¬ 
ing gemcitabine in dogs found the maximum tolerated dose of 22 mg/kg resulted in a C max of 
20-30 pg/mL which is equivalent to a molarity of 67 to 100 pM [45]. When the carboplatin 
ICs 50 for the lung cancer cell lines (50 and 214 pM for BACA and CLAC, respectively) is com¬ 
pared to ICs 50 for other cancer cell lines of canine origin, they are higher than then those for 
canine mammary gland tumors or canine melanoma (30.5 pM and 6.1 pM, respectively) [46, 
47]. The gemcitabine IC 50 differed between the cell lines, with the CLAC IC 50 being three-fold 
higher than BACA IC 50 . Our IC 50 findings for gemcitabine are similar to the ICs 50 reported for 
canine osteosarcoma cell lines, which ranged from 5.7 to 15.3 pM for the cell lines that had 
dose-dependent decreases in cell proliferation [48]. 

We evaluated the prodrug STA-1474 in lieu of ganetespib because of its greater solubility in 
water which facilitates its use in the dog without untoward side effects. A phase I study evalu¬ 
ated STA-1474 in dogs with solid tumors and reported measurable objective responses for 
malignant mast cell disease, osteosarcoma, melanoma and thyroid carcinoma [26]. Based on 
the favorable evidence of STA-1474 displaying potent activity against both canine lung cancer 
cell lines, we further investigated the effects of HSP90 inhibition in canine lung cancer. STA- 
1474 decreased cell viability at 72 h, induced apoptosis and promoted activation of caspase 3/7 
in a dose- and time-dependent manner in both cell lines. Apoptosis was detected 24 h after 
treatment and caspase 3/7 activity continued to increase during the first 48 h. As many of the 
HSP90 client proteins are needed for cell survival and proliferation [49] use of HSP90 inhibi¬ 
tors induces both cell cycle arrest and apoptosis in cancer cells [50]. Similar results have been 
seen in canine osteosarcoma and mast cell tumor cell lines treated with either ganetespib or 
STA-1474, inducing growth inhibition that was at least in part mediated by caspase 3/7-depen- 
dent apoptosis [51, 52]. 

Ganetespib, a second generation HSP90 inhibitor, effectively and simultaneously destabi¬ 
lizes HSP90 client proteins in NSCLC cells including receptor tyrosine kinases and canonical 
JAK/STAT, PI3K/AKT, MAPK and mTOR signaling. Moreover, ganetespib accumulates in 
tumors relative to normal tissues, with a half-life in the tumor that is 10- to 19-fold longer than 
in normal tissues or plasma [23]. Our experiments show that the prodrug of ganetespib, STA- 
1474, down-regulates signal transduction proteins in a dose-dependent manner. For some pro¬ 
teins (AKT, MAPK, STAT3) only the phosphorylated form is decreased but not the total pro¬ 
tein. Lack of total STAT3 down-regulation has been reported before with STA-1474 treatment 
[51], which is an unexpected finding as STAT3 is a HSP90 client protein. This result can be 
explained when individual protein turnover rate is considered. HSP90 inhibition should lead to 
the rapid degradation of newly synthesized proteins and those with a longer half-life will show 
a slower decrease in loss of protein after the HSP90 inhibition [53]. 

The tyrosine kinase inhibitor toceranib is a multi-targeted kinase inhibitor that effects both 
tumor cell proliferation and tumor angiogenesis. We sought to determine if common tyrosine 
kinases which support lung tumor growth could be inhibited by torceranib. Even after 72 h of 
exposure to the highest dose of torceranib used (1 pM) this compound was unable to abrogate 
the phosphorylated and total protein expression of all the kinases evaluated in both cell lines. 
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Further studies are needed to evaluate if torceranib demonstrates inhibitory activity against 
VEGFR, PDGFR and c-Kit protein expression in canine lung cancer cell lines. 

Exposure of tumor cells to HSP90 inhibitors induces a cellular protective and compensatory 
response which is to increase the expression of other heat shock proteins, notably HSP70. The 
increase of HSP70 expression has been shown to abrogate the extent of cell death [50]. Indeed 
we saw a reciprocal upregulation of HSP70 when both cell lines were treated with increasing 
doses of HSP90 inhibitors. Previous studies have reported an increase in HSP70 protein expres¬ 
sion when canine tumors and cancer cell lines were treated with STA-1474 [26, 51]. 

HSP70 has also been shown to be a driver of oncogenesis therefore therapy using a combi¬ 
nation of a HSP90 inhibitor with a HSP70 inhibitor may provide a wider therapeutic window 
and increase the target-driven therapeutic index. We wanted to investigate if the simultaneous 
inhibition of HSP70 would enhance the growth inhibitory effects of HSP90 treatment. The use 
of VER155008, a HSP70 inhibitor, suppressed cell growth in both cell lines but only at high pM 
concentrations. Similar results have been described in human NSCLC cells treated with this 
drug [54]. Surprisingly when the cell lines were treated with the fixed ratio combinations of 
STA-1474 and VER155008 based on their ICs 50 , the doublet combination effects were different 
not only between cell lines but also in terms of the desired effect level. Moderate synergy was 
seen when the BACA line was treated with the combination at concentrations higher than its 5 
IC 50 values, whereas in the CLAC line this combination created synergism only at the IC 50 
value. This was an unexpected finding, as VER155008 has shown to be synergistic with other 
HSP90 inhibitors in human NSCLC lines [54-56]. Multiple drugs can compete with each other 
for the same transporter, molecular target or have conflicting effects on the cell cycle. Studies 
suggest that cancer cells are sensitive to multiple drugs at a certain drug mixing ratio, and that 
the optimal mixing ratio must be retained in tumor tissues to achieve the maximal drug combi¬ 
nation effect [57-59]. The lack of a constant synergistic effect at all levels with the combination 
therapy may be due to the induction of other protective HSPs or upregulation of alternative 
oncogenic compensatory pathways not examined in this study. 

Evaluation of drug activity using a two-dimensional monolayer of tumor cells poorly mod¬ 
els the disease in vivo. Tumors are three-dimensional (3D) complex tissues composed of neo¬ 
plastic cells, vasculature and tumor stroma. As such, we sought to determine the response of 
STA-1474 in a 3D-model of canine lung cancer as well as in cells of the tumor stroma, repre¬ 
sented by tumor-stromal fibroblasts obtained from a primary canine lung tumor cell culture. 
Both cell lines grew as spheres in ultra-low attachment wells, as described by Vinci et al. [60]. 

In contrast to other studies, the need to add growth factors to the media to foster the formation 
of spheroids was not needed [61]. This was critical for the interpretation of the ICs 50 as addi¬ 
tion of growth factors can enhance cell proliferation and viability making direct comparison of 
spheroid and monolayer ICs 50 impossible. Spheroid conformation was different between the 
cell lines. This finding was similar to previous reports that have reported different cell lines 
from a variety of tumors grow in different spheroid patterns [60, 62]. The spheroid shape may 
influence drug responsiveness. Morphometric analysis of sphere area would be helpful to deter¬ 
mine relative cell viability after drug exposure. However, although the sphere area can be mea¬ 
sured using image analysis software, determination of area can be ambiguous in approximately 
50% of the evaluated cases [62]. The STA-1474 ICs 50 were higher for both spheroid cell cul¬ 
tures than for the monolayer adherent cultures. This was not unexpected as in general chemo¬ 
therapy drugs have increased potency in 2D models compared with spheroid cultures, 
although exceptions do exist with a better response seen in 3D models to some pharmaceuticals 
[62]. The drug effectiveness in limiting spheroid viability is in part due to the ability of the 
agent to diffuse into the sphere. Therefore, although spheroids are a better model to evaluate 
cellular responsiveness to drugs, caution must be used when interpreting the ICs 50 as lack of 
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vascularization responsible for the delivery and distribution of the agent throughout the sphere 
is not present. 

The tumor stroma is composed of cancer-associated fibroblasts (CAFs), cells that are phe- 
notypically and functionally different from their normal counterparts. CAFs can induce cancer 
cell sternness as well as epithelial to mesenchymal transition to promote tumor progression 
[63]. CAFs can also induce therapeutic resistance in NSCLC [64]. Moreover, cytotoxic treat¬ 
ment can increase both CAF percentage and cytokine secretion, especially IL-17A, both of 
which contribute to cancer-initiating cells growth and therapeutic resistance [65]. However, 
stromal fibroblasts also respond to the neoplastic epithelial cells by expressing growth factors 
[66]. Because FGFR3 is a known HSP90 client protein [67] we treated tumor stromal fibro¬ 
blasts with STA-1474. Although the IC 50 for tumor stromal fibroblasts was higher than the cell 
lines it was still within a biologically achievable dose. This result was dissimilar to what has 
been reported with other HSP90 inhibitors, which have decreased CAFs cytokine production 
but not viability [68]. 


Conclusions 

According to our genetic and biochemical analyses, canine lung adenocarcinoma cell lines 
are relevant to the same human cancer. Likewise, the results of the canine aCGH and the vari¬ 
ability in biological response to the chemotherpy agents found in these two cell lines suggests 
that the canine patient would benefit from precision medicine which has significantly 
improved the quality of life of the human lung cancer patient. We showed that treatment 
with STA-1474 decreases BACA and CLAC viability and HSP90 client protein expression at 
biologically relevant doses, by inducing dose- and time-dependent apoptosis. In contrast 
with other small molecules inhibitors, STA-1474 affects several proteins in different cellular 
signaling pathways and decreases viability not only in tumor cells, but also stromal fibro¬ 
blasts. The efficacy of two small molecule inhibitors in canine lung cancer cell lines grown in 
a 3D spheroid format established that ICs 50 will be increased when compared to the ICs 50 
obtained when cells are grown in a monolayer. This finding re-enforces the importance of 
evaluating cellular responses to drugs in a model that more accurately mimics the natural 
tumor environment. Nevertheless, the preclinical activity profile of STA-1474 demonstrated 
in this study provides preliminary evidence that this compound is superior to most of the 
currently available drugs and may offer an effective therapeutic opportunity to manage the 
canine lung cancer patient. 
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Abstract: Breast cancer represents the second most frequent neoplasm in humans and sexually 
intact female dogs after lung and skin cancers, respectively. Many similar features in human 
and dog cancers including, spontaneous development, clinical presentation, tumor heterogeneity, 
disease progression and response to conventional therapies have supported development of this 
comparative model as an alternative to mice. The highly conserved similarities between canine 
and human genomes are also key to this comparative analysis, especially when compared to the 
murine genome. Studies with canine mammary tumor (CMT) models have shown a strong genetic 
correlation with their human counterparts, particularly in terms of altered expression profiles of 
cell cycle regulatory genes, tumor suppressor and oncogenes and also a large group of non-coding 
RNAs or microRNAs (miRNAs). Because CMTs are considered predictive intermediate models for 
human breast cancer, similarities in genetic alterations and cancer predisposition between humans 
and dogs have raised further interest. Many cancer-associated genetic defects critical to mammary 
tumor development and oncogenic determinants of metastasis have been reported and appear to 
be similar in both species. Comparative analysis of deregulated gene sets or cancer signaling 
pathways has shown that a significant proportion of orthologous genes are comparably up- or 
down-regulated in both human and dog breast tumors. Particularly, a group of cell cycle regulators 
called cyclin-dependent kinase inhibitors (CKIs) acting as potent tumor suppressors are frequently 
defective in CMTs. Interestingly, comparative analysis of coding sequences has also shown that these 
genes are highly conserved in mammals in terms of their evolutionary divergence from a common 
ancestor. Moreover, co-deletion and/or homozygous loss of the INK4A/ARF/INK4B (CDKN2A/B) 
locus, encoding three members of the CKI tumor suppressor gene families (pl6/INK4A, pl4ARF 
and pl5/INK4B), in many human and dog cancers including mammary carcinomas, suggested 
their important conserved genetic order and localization in orthologous chromosomal regions. 
miRNAs, as powerful post-transcriptional regulators of most of the cancer-associated genes, have 
not been well evaluated to date in animal cancer models. Comprehensive expression profiles of 
miRNAs in CMTs have revealed their altered regulation showing a strong correlation with those 
found in human breast cancers. These genetic correlations between human and dog mammary 
cancers will greatly advance our understanding of regulatory mechanisms involving many critical 
cancer-associated genes that promote neoplasia and contribute to the promising development of 
future therapeutics. 

Keywords: canine; mammary cancer; oncogenes; tumor suppressor genes 
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1. Introduction 

In the field of human cancer research, there is an intense interest in development of appropriate 
model systems for the advancement of future therapeutic inventions. Companion animals such 
as domesticated dogs (Canis lupus familiaris) are considered excellent preclinical models of cancers 
and other complex human diseases for many reasons, including their easy accessibility and living 
status in diverse cultures [1]. Since they are treated as pet animals, most of the dog population 
shares the same environment, risk factors or disease characteristics with the human population [2,3] 
which provides an added advantage for scientists to investigate cancer etiologies. Additionally, dogs 
represent a more outbred population than inbred laboratory animals providing a genetic diversity 
similar to that observed in humans [4]. 

Canine models address two important issues in cancer research. First, in terms of similarities, 
dogs spontaneously develop cancers in the context of a natural immune system with a clinical 
presentation, tumor genetics and heterogeneity, disease progression and response to conventional 
therapies [5] that better models the complex biology of cancers and their interactions with the immune 
system in human patients than mouse models. The similarities between the dog and human genomes 
have also greatly enhanced comparative genomic analysis. With the advent of the high resolution 
2.4 billion bp canine genome sequence and the identification of nearly all of its genes as clear orthologs 
of known human genes [4], the dog has emerged as a valuable comparative and intermediate model 
for the study of human cancers. The high level of sequence conservation between canine and human 
genomes are key to this comparative analysis especially since 600 Mb of DNA sequence conserved 
between dog and human is missing from the murine genome [1]. Secondly, using dogs as animal 
models may contribute to the development of cancer therapeutics for, not only human and dog, 
but also other species—a promising theme lately coined as "One Medicine" that campaigns under 
a unified scientific platform where discoveries in one species can be translated to others to improve 
health management in all species. Canine tumors with potential relevance for human cancer biology 
include osteosarcoma, mammary carcinoma, lymphoma, melanoma, lung carcinoma, and soft tissue 
sarcomas [6]. 

2. Canine Mammary Tumors (CMTs) 

Mammary tumors are the most common neoplasm in sexually intact female dogs. The severity of 
canine mammary tumors (CMT) can be appreciated from a number of studies that reported increased 
rates of incidence in the dog population globally. Breast cancer represents the second most frequent 
neoplasm in humans and all dogs after lung and skin cancers, respectively, although many reports 
indicate that dogs are two to four times more susceptible to mammary cancers than women in certain 
geographical areas [7-10]. Nearly 50% of all these neoplasms are diagnosed as malignant and more 
than 95% of these malignant CMTs are carcinomas [11,12]. 

Canine mammary carcinomas are biologically heterogeneous neoplasms offering several ways 
to classify such tumors on the basis of histopathological characteristics or expression of molecular 
markers [13]. Despite the appearance of histomorphological variations between human and canine 
breast cancers, due to various prognostic indicators, a number of studies have reported that there 
are significant similarities regarding molecular marker expression, hormone dependency and cancer 
phenotypes [11-14]. It is important to classify breast cancer in order to correlate clinical phenotypes, 
invasion or grade of progression and to develop prognostic markers. The human classification 
of breast cancers based on expression profile of luminal epithelial specific genes and hormone 
receptors including estrogen receptor 1 (ESR1), progesterone receptor (PR) and proto-oncogenes such 
as epidermal growth factor receptors (EGFR/HER2), have also identified similar molecular subtypes 
in CMTs, but unlike human subtypes, these are not routinely investigated for CMTs during clinical 
diagnosis [15,16]. Recently, in more refined studies employing immunohistochemical approaches and 
based on the characteristic expression patterns of ESR1, PR and EGFR (ERBB1/HER1, ERBB2/HER2, 
ERBB3 and ERBB4), human-like breast cancer phenotypes for CMTs have been developed 
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and classified as luminal A, luminal B, HER2 positive and triple negative (basal-like) [17,18]. 
Such standard classification therefore strongly supports canine mammary tumors as valuable 
intermediate models for human breast cancer that should be well-placed for developing diagnostic 
and treatment strategies. 

Because CMTs are considered predictive models for human breast cancer [6], similarities in 
genetic alterations and cancer predisposition between humans and dogs have raised interest even 
further. A large number of studies have demonstrated that CMTs have many similarities in molecular 
and clinical features with human breast cancer. Many genetic/epigenetic/tumor biology traits 
that are most frequently associated with mammary cancer have been identified and comparative 
gene expression analysis has revealed a significant similarity in the canine and human genes 
associated with mammary tumor development [19]. Although CMTs have not yet been classified 
based on surface markers, due to an absence of appropriate antibodies identifying human breast 
cancer subtypes, the expression profile of vital genes involved in cellular proliferation, angiogenesis, 
apoptosis, cell cycle regulation, DNA damage repair, signal transduction, and survival pathways 
firmly correlate to those in human breast cancer [19,20]. These studies characterized CMTs, based 
on genome-wide gene expression changes, comparing to human breast cancer, suggesting that 
mutations and alterations in the cancer genome may promote deregulation of individual genes in 
mammary cancers. 

Comparative analysis of deregulated gene sets or cancer signaling pathways showed that 
a significant proportion of orthologous genes are comparably up- and down-regulated in both human 
and canine breast tumors. Prominent oncogenic pathways and related genes, such as PI3K/AKT, 
KRAS, MAPK, Wnt, (3-catenin, BRCA2, ESR1 and P-cadherin, are commonly up-regulated while 
representative tumor suppressive pathways, such as p53, pl6/INK4A, PTEN and E-cadherin, are 
down-regulated in human and canine breast cancer [19,21-25]. This chapter will discuss the 
comparative aspects of cell cycle regulatory genes, particularly the evolutionary descent, structure, 
genomic localization, biological functions, expression defects and post-transcriptional regulation 
of the CDKN2/INK4 family of cyclin-dependent kinase inhibitors (CKIs) in canine and human 
breast cancers. 

3. Cell Cycle Regulators: A Classic Repertoire of Tumor Suppressors 

From simple eukaryotes such as yeast to higher mammals, the cell cycle serves as a fundamental 
biological process by which cells grow and divide and its regulation is central to cancer promoting 
mechanisms. Cancer causes are complex involving dysregulation of cellular functions and 
dysregulation in micro-environmental signaling as well as being rooted in oncogenic mutations [26]. 
Ultimately, cancers occur due to an alteration in the regulation of cell proliferation. Cell proliferation 
itself is rooted in the cell cycle which is a highly regulated process governed by complex 
mechanisms [27]. The uncontrolled cell proliferation in cancer is also associated with a vicious 
cycle where cells divide through unchecked cell cycle progression with a reduction in sensitivity 
to signals that normally guide cells to adhere, become quiescent, terminally differentiate or die. 
This combination of unregulated proliferation and a failure of balancing suppressor activities is 
hallmark of malignant transformation resulting in neoplasia that can eventually develop the ability 
to spread and migrate throughout the body through metastasis [28]. One such group of genetic 
alterations that contribute to cancer development are often termed hypermorphic mutations that 
largely define oncogenes and result from the mutated versions of normal cellular proto-oncogenes. 
Oncogenic mutations appear to destroy the integrity and modulated control of cell proliferation first 
by altering control of the stimulatory pathways that promote cell growth. They may also promote 
neoplasia by suppressing those pathways normally responsible for modulating and inhibiting 
proliferation or causing exit from the cell cycle entirely. These "loss of function" mutations occur 
in tumor suppressor genes that encode proteins that can negatively regulate cell cycle progression 
but, when mutated, are permissive for cancer development and can promote spontaneous as well as. 
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in some instances, hereditary forms of cancer [29]. Two important examples of such loss-of-function 
mutations affecting cell cycle regulation are mutations in the retinoblastoma (Rb) and pl6/INK4A 
tumor suppressor genes [30,31]. Loss of function of these tumor suppressor gene products results 
in liberation of the E2F transcription factors, associated with S phase promotion, that consequently 
remove control of cell cycle exit during G1 phase resulting in abnormal and continuous cellular 
proliferation [29]. 

A group of inhibitory proteins, called cyclin-dependent kinases inhibitors (CKIs) or CDK 
inhibitors, control cyclin-CDK activity thereby restraining cell cycle progression in response to 
extracellular and intracellular signals [32,33]. The orderly progression of the cell cycle is fine-tuned 
by the genes encoding such negative regulators, or CKIs, and positive regulators including the 
cyclins and CDKs. Dysregulation of these genes can lead to premature and unregulated entry into 
the next phase of the cell cycle leaving the previous phase unchecked, and frequently this occurs 
prior to completion of critical molecular events such as repair of DNA damage or replication errors. 
Such dysfunction frequently triggers genomic instability and neoplastic transformation. Based on 
their structural similarities and specific roles in cell cycle regulation, CKIs are divided into two distinct 
groups: the INK4, or CDKN2, and the Cip/Kip, or CDKN1, families [33]. The first group representing 
the INK4 proteins (Inhibitors of CDK4) are so named because of their ability to specifically inhibit 
the catalytic subunits of CDK4 and CDK6. It has been reported that INK4 proteins compete with 
cyclin D for binding to the CDK4/6 subunit [34,35]. The members of the INK4 protein family that 
share common structural features are pl6/INK4A (and pl4ARF, an alternatively spliced product 
from the same locus), pl5/INK4B, pl8/INK4C and pl9/INK4D (Figure 1) [36-38]. The Cip/Kip 
family (for CDK interacting protein/ Kinase inhibitory protein) consists of three members, including 
p21/Cipl, p27/Kipl and p57/Kip2, all of which share a common inhibitory domain that enables 
them to bind CDK complexes [37,39]. These proteins of the Cip/Kip family have broad specificity 
for binding and inhibiting a number of cyclin-CDK complexes compared to that of INK4 members. 
They also inhibit the activity cyclin D-CDK4 preventing Rb phosphorylation during G1 to S phase 
transition. In addition, they inhibit cyclin A-CDK2 in late G1 phase and cyclin E-CDK2 in early 
S phase (Figure 1) [37]. Therefore, both CKI families are important modulating components of the 
complex network of cell cycle regulatory mechanisms. 

3.1. CDK Inhibitors Form a Repertoire of Tumor Suppressor Proteins 

Many studies stress the fact that CDKs are positive regulators and CKIs are negative regulators of 
cell proliferation based on their distinct inhibitory actions in the eukaryotic cell cycle [38,40]. Besides 
their specific roles in cell cycle regulation, differentiation and development, CKIs are proven or highly 
likely tumor suppressors to have this potential, as mutations in these genes promote malignant 
phenotypes [32,36,37,41]. In some clinical trials, CKI tumor suppressors aggressively promote cancer 
cell growth by inducing p53 function and stability and increasing anti-proliferative activity thereby 
inhibiting cell cycle progression [42]. Among all the CKIs, pl6/INK4A is the founding member and 
was the first classified as a major tumor suppressor gene (only preceded by p53 for many human 
malignancies) because the mutations in the INK4A/ARF locus, and loss of heterozygosity of the 
chromosomal region encoding this gene, have been reported in a wide range of cancers including 
melanomas, leukemias, gliomas, lung, breast and bladder cancers [36,41]. The loss of expression 
of the neighboring pl5/INK4B gene, due to promoter hypermethylation, also occurs in a number 
of leukemias and lymphomas [38,43]. The pl6/INK4A locus has also been found to be frequently 
mutated in canine malignant melanomas, mammary carcinomas and fibrosarcomas [21,44-47]. 
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Figure 1. Cyclin-Dependent Kinase Inhibitors (CKIs) are regulators of the cell cycle. Cell cycle phases, 
major regulatory proteins or protein complexes including cyclins, CDKs, INK4 and Cip/Kip inhibitors 
and their targets are shown. Checkpoint (CP) or major restriction point. 

3.2. Evolutionary History , Genomic Localization and Structure of the INK4A/ARF Locus 

A locus on the short arm of human chromosome 9p21 (a known multiple tumor suppressor 
locus) encodes three products called pl6/INK4A, pl4ARF and pl5/INK4B, all of which regulate 
cell proliferation by inhibiting the cyclin-CDK complex at the G1 to S phase transition in the cell 
cycle [36,41]- The organization of the INK4A/ARF/INK4B (or INK4A/ARF) locus in the mammalian 
genome is highly conserved. Orthologous sequence searches and comparative genomics analysis has 
demonstrated that this locus in human (chromosome 9) is syntenic to that of chimp (chromosome 9), 
dog (chromosome 11), cat (chromosome D4), mouse (chromosome 4) and rat (chromosome 5) and 
this region, encoding several tumor suppressor genes, is highly susceptible to genetic instability and 
mutations in many cancers [36,37,48]. The close similarities between the INK4A and INK4B genes 
and two other members of the INK4 CKI gene family, INK4C and INK4D based on their protein 
sequence, biochemical properties and functions in the cell cycle, suggest that they arose as a result of 
gene duplication during the course of evolution. This is most likely true since a number of studies 
have demonstrated that all four INK4 CKIs share a common structural feature called ankyrin repeats 
that appear to function as a structural scaffold facilitating protein-protein interactions and these four 
CKIs also appear functionally related [36,49,50]. 

The evolutionary history of the INK4A/ARF/INK4B locus suggests that the INK4 genes have 
evolved through tandem gene duplication events. One of the most interesting findings for the 
evolutionary descent of INK4 genes was the complete absence of ARF-like gene products in the 
Japanese puffer fish Fugu rubripes (fugu) and in the zebrafish [37,48] suggesting that pl4ARF was 
introduced into the vertebrate or mammalian genome following INK4 duplication. Three unique 
INK4 genes, representing INK4A or B, INK4C and INKD have been identified in the fugu genome 
(Figure 2). Evolutionarily, pl6/INK4A and pl5/INK4B are products of a local tandem duplication 
while pl8/INK4C and pl9INK4D are present on other chromosomes [37,48]. Cross-species 
comparative analysis suggested that a single common ancestral INK4 gene was present and a series 
of duplication and rearrangement events first gave rise to INK4A/B and INK4C/D-like elements in 
a common vertebrate ancestor and after the divergence of higher vertebrates from tetrapod and fish 
approximately 350 million years ago (MYA) gave rise to the individual INK4 genes in the mammalian 
genome [37]. 
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Figure 2. INK4/CDKN2 family tree. Annotated INK4 proteins from select organisms were aligned 
using Clustal W. The alignment was used to construct a neighbor-joining phylogenetic tree (applying 
complete deletion of gaps and Poisson model rates and patterns; MEGA6). Bootstrap values 
were calculated from 500 repetitions. Similar results were achieved with maximum parsimony 
phylogenetic treeing (not shown). The phylogenetic analyses (see text) demonstrate the high 
similarity and conservation among INK4 proteins as well as their evolutionary descent. The scale bar 
shows the number of substitutions per site. NCBI GI accession numbers of the proteins are given on 
the tree along with the common or abbreviated animal name. Taxonomy abbreviations follow: zfish, 
Danio rerio (zebrafish); fugu, Takifugu rubripes (Japanese puffer fish); Xenopus tropicalis (western clawed 
frog); chick, Gallus gallus (chicken, red junglefowl); opossum, Monodelphis domestica; cow. Bos taurus; 
dog, Canis lupus familiar is; rat, Rattus norvegicus (Norway rat); mouse, Mus musculus (house mouse); 
rhesus, Macaca mulatta (Rhesus macaque); chimp. Pan troglodytes (common chimpanzee); human. 
Homo sapiens. 
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These evolutionary changes placed pl6/INK4A and pl5/INK4B about 30 kb apart in the 
same transcriptional orientation on chromosome 9p21 whereas pl8/INK4C and pl9/INK4D are 
present on human chromosomes lp32 and 19pl3, respectively [36]. Phylogenetic trees based on 
the published amino acid sequences of INK4 proteins indicates their high similarities among groups 
and likely divergence from a common ancestor (Figure 2). This evolutionary relationship suggests 
that the pl6/INK4A and pl5/INK4B from mammals represent a paralogous group that was once 
related to pl6/15 in fugu or zebrafish while pl8/INKC and pl9/INK4D are more closely related to 
corresponding orthologs in fugu. Figure 2 suggests that the complement of INK4 genes arose before 
the marsupial-placental mammal divergence. 

During the evolution of pl6/INK4A and pl5/INK4B through gene duplication, an additional 
exon appeared when comparing the two genes. This alternative exon is designated exon 1(3 which is 
alternatively spliced to exon 2 and 3 of pl6INK4A making the novel pl4ARF transcript (Figure 3) [37]. 
Previously, it was postulated that exon 1(3 was the original exon 1 of the INK4A locus but later 
it was determined that this alternative exon was transcribed from its own separate promoter and 
not from the promoter of pl6/INK4A exon la [51]. The presence of such a separate promoter for 
pl4ARF suggests that its transcription is regulated independently of pl6/INK4A. Gene duplication, 
rearrangement and deletion appear to have resulted in a duplicated exon 1 (3 located in the intergenic 
region between the INK4A and B genes that later diverged from each other [37,48]. Interestingly, 
the highly cancer resistant naked mole rat has recently been shown to have an unusual fusion of the 
p 15/pi6 tumor suppressor genes (PMID: 25550505). This may represent a further gene rearrangement 
whose relationship to relative cancer resistance and unusually long life in this rodent remain to 
be investigated. 

3.3. Roles of INK4A/ARF Encoded Regulators in the Cell Cycle and Cancer 

The existence of pl6/INK4A protein was first discovered as a binding partner of cyclin 
D-dependent CDK4 by the co-immunoprecipitation assay. In cells transformed by SV40 virus, CDK4 
was found to be predominantly associated with pl6 rather than cyclin D unraveling an important 
function of this founder member of the INK4 family and suggesting that pi6 can directly bind to 
the catalytic CDK4 subunit in the absence of regulatory cyclin D [40]. Other INK4 members (pl5, 
pl8 and pl9) were found to interact with CDK4 and CDK6 by two-hybrid screening. Both in vitro 
and in vivo studies have reported that all of the four INK4 proteins directly bind the kinase subunits 
(CDK4/6) rather than the cyclin subunit (cyclin D) as they act as competitive inhibitors of the 
cyclins [52]. This specific interaction with CDKs distinguishes the INK4 family from the Cip/Kip 
family of CKIs [36]. Because there is no sequence similarity between exon 1(3 of pl4ARF and exon 
la of pi6 and alternative splicing of exon 1(3 to the shared exon 2 allows translation to continue 
from the —1 nucleotide of the open reading frame of pi6, pl4ARF encodes a completely different 
protein compared to pl6. These two proteins also function in distinct biological pathways. Rb is 
a critical substrate for cyclin D-dependent kinases [40,53] and its phosphorylation is required to 
release and activate the E2F transcription factors switching on gene expression involved in the G1 to 
S phase transition [54]. pl6/INK4A and the three other INK4 members prevent Rb phosphorylation 
by inhibiting CDK4/6 binding with cyclin D [34,35]. This cascade pathway in turns leads to E2F 
repression that inhibits the transcription of many genes required for exit from G1 and initiation of 
S phase eventually resulting in growth arrest [37,54]. 
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Figure 3. Alternative splicing results in two different transcripts and protein products from the 
modern INK4A/ARF locus. The exons are shown as boxes and the sequences encoding pl6/INK4A 
are shown as red shading while those encoding the ARF transcript are colored blue. Exon la is spliced 
to INK4A exon 2 and 3 forming the pl6 mature transcript whereas exon 1(3 is alternatively spliced to 
the same exon 2 and 3 generating the mature pl4ARF transcript. The latter produces a different 
protein from pl6 because translation occurs from an alternative reading frame. The sizes of the 
respective human, dog and mouse pl6 and ARF proteins are shown in the bottom panel, (p 14ARF in 
mouse is named pl9ARF due to its increased length but should not be compared to pl9/INK4D). 


On the other hand, p 14ARF is highly unlikely to act as a direct inhibitor of CDK4/6 because 
of its structural differences from other INK4 proteins. A great number of studies, using mouse 
models and human cancer cells, differentiated the functions and regulation of p 14ARF from that 
of pl6. The initial evidence for its anti-proliferative role came from observations that expression 
of p 19ARF (the p 14ARF ortholog in mouse) in embryonic fibroblasts or NIH 3T3 cells induced cell 
cycle arrest but no direct interaction with CDK complexes was detected in immunoprecipitation 
assays [55]. It has been reported that loss of pl9ARF obviates the requirement of p53 inactivation 
to immortalize mouse embryonic fibroblasts and tumors, including melanomas, in vivo [56,57]. 
This understanding was further refined by other studies demonstrating that suppression of oncogenic 
transformation in primary cells by p 19ARF is abrogated when p53 is inactivated by viral oncoproteins 
or dominant p53 mutants [58] implying that pl9ARF functions upstream of the p53 pathway. 
Moreover, some groups reported that pl9ARF can associate with MDM2 (a p53 ubiquitin protein 
ligase) or inhibit the E3-ligase activity of MDM2 to prevent MDM2-induced p53 degradation [57-61] 
suggesting that these proteins—pl9ARF, MDM2 and p53—exist in a common regulatory pathway. 
In addition to p53 stabilization, pl4ARF regulates p53 transactivation activity. p53 normally acts 
as a strong transcriptional activator of p21/Cipl protein [38]. Expression of pl9ARF in primary 
mouse cells expressing functional p53 results in the induction of p21 that plays essential roles in 
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G1 to S phase arrest, apoptosis and tumor growth suppression [57-59]. Investigating mutations 
and gene expression profiles of cell cycle regulatory proteins in many human cancer cell lines and 
primary tumors provided evidence that p53 mutations do not directly correlate with either pi6 
or Rb expression [30] stressing the fact that pl4ARF (in the p53 pathway) and pi6 (in the Rb 
pathway) have distinct or non-overlapping, important biological functions in cell cycle regulation 
and cancers [36,37]. Thus, this single locus capably regulates the key pathways controlling cell 
proliferation—the Rb-dependent and p53-dependent pathways. 



Frequently deleted regions in human chr. 9p21 and orthologous canine chr. 11 

Figure 4. Relative molecular and cytogenetic mapping of the INKA/ARF locus and closely related 
genes with their positions on human and canine chromosome 9 and 11, respectively. The regions 
at human chromosome 9 and canine chromosome 11 that are frequently deleted in cancers are 
completely orthologous to each other. The molecular mapping shows the exact chromosomal position 
of these genes extrapolated from the NCBI map view of each chromosome represented by the current 
human and canine annotation from releases 106 and 103, respectively. The red and blue arrows 
indicate the transcriptional orientation of genes in the human and dog chromosomes, respectively. 
Transcription of genes from the "+ strand" is indicated by down arrows and from the strand" by 
up arrows. (CFA = Canis lupus familiar is; HSA = Homo sapiens; Chr. = Chromosome). 

3.4. Alteration of the INK4A/ARF Locus in Human and Canine Cancers 

There is compelling genetic evidence from numerous cancer studies that pl6/INK4A is 
a critical tumor suppressor gene whose direct inactivation by point mutation, deletion, or promoter 
hypermethylation is observed in nearly one third of human cancers, establishing its loss as one of the 
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most frequent lesions promoting human malignancy [37]. The pl6/INK4A gene was independently 
isolated as a candidate tumor suppressor gene located at human chromosome 9p21, the region which 
is highly conserved across mammals, and was found to be frequently deleted in many human tumors 
and linked to hereditary susceptibility to melanoma [60-62]. The emergence of human chromosome 
9p21 as a site of a major tumor suppressor gene was deduced from extensive cytogenetic and 
loss of heterozygosity (LOH) studies on a wide range of tumors such as leukemias, melanomas, 
gliomas, pancreatic adenocarcinomas, as well as breast, lung and bladder cancers [61,63-68]. LOH of 
chromosome 9p21 that encodes the INK4/ARF locus was also deleted in the study of a neighboring 
gene called methylthioadenosine phosphorylase (MTAP) that also mapped to the same chromosomal 
region [69]. MTAP, a regulatory gene for purine and polyamine biosynthesis, is frequently deleted 
in different malignant cancer cell lines that also have homozygous deletion of pi6 suggesting that 
loss of MTAP in malignant cells is primarily due to linkage between the MTAP and pl6 genes on the 
same chromosomal region and so they were co-deleted [70]. Furthermore, some malignant cells were 
found to have homozygous deletion of pl6 and MTAP but retained an intact pl5 gene. These findings 
of homozygous deletion of pi6 and its neighbor in cancer cells also revealed the gene order on 
chromosome 9p21 starting from the centromeric end which is pl5, (pl4ARF) pl6, MTAP, IFNA and 
IFNB (interferon alpha and beta) (Figure 4) [70]. 

Studies with non-human animal models of cancers have also reported genetic defects in the 
INK4A/ARF locus. The in vivo role of pl6 in tumorigenesis was initially indicated from mapping 
tumor susceptibility alleles in common BALB/c mouse strains. This mouse model is prone to tumor 
development such as plasmacytoma (tumors of the plasma cells) and lung adenocarcinoma in which 
the major genetic determinant responsible for a strong cancer predisposition also mapped to the 
INK4A/ARF locus [71,72]. Mice with targeted deletions of pl6, pl9ARF or both were investigated 
by several groups suggesting that mouse strains with specific inactivation of pl6 or pl9ARF were 
tumor prone but neither genetic loss alone was as severe as those with double knockouts of both 
of these genes [57,73-75]. Mutant mice that were deficient for pl6 and heterozygous for pl9ARF 
spontaneously develop a wide range of tumors including melanoma [95]. Importantly, primary 
melanomas, mammary carcinomas and osteosarcomas from dogs have also been reported to harbor 
frequent defects in pl6/INK4A [21,45,46,76]. Altered expression profiles from pl6/INK4A/ARF 
have been recurrently observed in a number of canine breast cancers, melanomas and other primary 
tumors that highly correlate to lesions in humans and mice [30,37,77]. In fact, altogether, deletions or 
point mutations causing shifting of reading frame and altered expression located mostly in exon 1 oc 
have been found in cancers from humans, dogs and mice suggesting that specific mutation mapping 
in pl6/INK4A and its regulation are not limited to these cancer types (for example, melanomas and 
breast cancers) but also occur in other tumors commonly encountered in mammalian species with 
neoplasms or uncontrolled cellular growth [21,71,72,77,78]. 

Furthermore, the region at canine chromosome 11 (orthologous to human chromosome 9p21) 
encoding INK4A/ARF, MTAP and close neighbors including miR-31, as shown in the comparative 
chromosomal mapping (Figure 4), is also highly susceptible and prone to concomitant deletion in 
many cancers in dogs [16]. Reports from several studies suggested that a haplotype spanning MTAP 
and INK4A/ARF loci showed susceptibility to naturally occurring canine sarcomas [79]. The miR-31, 
one of the highly cited tumor suppressor miRNAs in human breast cancer, was also found to 
be down-regulated and differentially expressed in canine osteosarcoma and mammary tumors, 
respectively [80]. Therefore, the comparative analysis of cytogenetic and molecular mapping of the 
genetic defects at human chromosome 9p21 and the corresponding canine chromosome 11 identified 
frequently deleted regions encoded by the INK4A/ARF/INK4B locus with a highly conserved order 
of genes (Figure 4) that are concurrently lost in many cancers recapitulating the strong similarities in 
genetic alternations and cancer predisposition between humans and dogs. 
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4. Regulatory, Small Non-Coding RNAs: microRNAs in Cancers 

It is increasingly apparent that a significant portion of the mammalian genome (estimated to 
be >70%) encodes regulatory information that is largely carried out by non-coding RNAs [81-83]. 
These non-coding RNAs consist of two major classes: small non-coding RNAs (<200 bp, including 
miRNAs) and long noncoding RNAs or IncRNAs (>200 to -100 kb) [84,85]. IncRNAs share many 
features of mRNAs, but in contrast to mRNAs, they are found within introns of protein coding 
genes or intergenic regions of the genome [83] demonstrating developmental and tissue-specific 
expression patterns [86]. The IncRNAs play a number of important regulatory functions that affect 
epigenetic changes including chromatin remodeling, transcriptional co-activation and repression, 
post-transcriptional modification of mRNAs as well as cellular functions including differentiation 
and homeostasis [84]. Dysregulated expression of IncRNAs causes disruption of these biological 
functions and plays a critical role in cancer development [85]. To date, a number of IncRNAs have 
been implicated in breast cancer development and metastasis. One of the most well-known and first 
identified IncRNAs is a HOX antisense intergenic RNA that is commonly abbreviated as HOTAIR. 
This IncRNA, located in the mammalian HOXC locus, has been demonstrated to be associated with 
polycomb repressive complex 2 that mediates transcriptional repression of numerous genes involved 
in differentiation pathways during development and stem cell pluripotency [87-90]. Importantly, 
HOTAIR has been reported to be highly upregulated in both primary and metastatic breast cancers 
and its overexpression is a strong predictor of metastasis and poor survival [87]. However, unlike 
the rapid advances in miRNA research, including the well-established mechanisms of miRNAs 
in gene silencing and the strong sequence conservation of miRNAs across mammals, knowledge 
regarding the molecular mechanisms of IncRNA function in cancer is still growing. Most IncRNAs 
are poorly conserved, and their mechanisms of action remain unclear and are in need of further 
exploration [83,84]. 

The discovery of microRNAs (miRNAs) established a new era in translational regulation 
research and for understanding post-transcriptional regulation of genes as well as their critical 
regulatory roles in diverse biological processes including cell cycle, cell proliferation, differentiation, 
development and apoptosis as well as in disease pathogenesis [91-93]. miRNAs are evolutionarily 
conserved, endogenous small structural RNA molecules (-22 nucleotides) that post-transcriptionally 
suppress gene expression in a sequence specific manner [94]. Expression of these small structural 
RNAs is tightly regulated during development and in normal mature tissues and is frequently altered 
in cancer [95]. Strikingly, more than 50% of miRNA genes are located in cancer associated genomic 
regions or fragile sites that are also preferential sites for translocation, deletion, amplification, and 
integration of exogenous genome fragments suggesting that miRNAs play an important role in the 
pathogenesis of many human cancers [96,97]. Since miRNAs are encoded by highly conserved and 
naturally occurring genes across mammalian species, evaluation of their expression profiles in cancer 
models shows great promise for advancing the development of future therapeutic reagents, as well 
as for improving diagnostic and prognostic analysis. 

miRNAs and their associated proteins appear to be one of the most abundant biomolecules in 
the cell. Improvements in small nucleotide amplification technologies and in sequence prediction 
algorithms, miRNA discovery from model organisms, as well as non-model species, have greatly 
advanced with 35,828 mature miRNAs in 223 species putatively identified to date (miRBase v20.0). 
Based on this latest estimate, the human, mouse and canine genomes account for 2588, 1915 and 
453 mature miRNAs, respectively, and these numbers reflect those miRNAs identified to date and 
this is anticipated to grow to similar numbers in most mammals compared to the human genome. 
The number of experimentally validated miRNAs from each species is smaller than the predicted 
number. However, both bioinformatics and empirical evidence suggests that more than 30% of 
protein-coding genes in the human genome are subjected to regulation by miRNAs, indicating their 
prominence as global regulators of gene expression [98-101]. 
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In mammals, mature miRNAs generated from sequential processing of primary miRNA 
transcripts by Drosha and Dicer miRNA processing complexes associate with 3'untranslated regions 
(3'UTR) of specific target messenger RNAs (mRNAs) to suppress translation and may also induce 
their degradation [102]. In the nucleus, the RNase Ill-type enzyme Drosha processes the long 
primary transcripts (pri-miRNA that is initially transcribed by RNA polymerase II from the cellular 
genome), yielding 60-70 nucleotide hairpin precursors called pre-miRNA. The resulting pre-miRNA 
hairpins are translocated to the cytoplasm by Exportin-5. In the cytoplasm, the pre-miRNAs are 
further cleaved and processed into 19-25 nucleotide miRNA duplex structures by the RNase Dicer 
and transactivator RNA binding protein (TRBP). The functional strand (or guide strand) of the 
mature miRNA is loaded together with Argonaut (Ago2) proteins into an RNA-induced silencing 
complex (RISC), where it guides RISC to silence target mRNAs through mRNA cleavage, translational 
repression or deadenylation. The passenger strand (the complementary strand of the double stranded 
pre-miRNA following Dicer processing) is typically degraded [102]. The mature miRNAs usually 
target the 3'UTR of mRNAs and make complementary base pairing with their seed (core orthologous 
target) sequences (located at 2-8 bases from the 5'end of the miRNA) [97]. The seed sequence, by 
which miRNAs bind to their targets, is only several nucleotides long, suggesting that each miRNA 
may potentially bind to a large number of genes thereby regulating their expression. miRNAs can 
direct the RISC complex to downregulate target gene expression by either of two post-transcriptional 
mechanisms: mRNA cleavage or translational repression [98,103,104]. The execution of one of these 
mechanisms is primarily determined by the degree of complementarity between the miRNA and 
its target mRNA. The miRNA will promote the cleavage of the target message if its seed region is 
sufficiently complementary to the target sequences [105]. After degradation of the mRNA, the miRNA 
remains intact and can guide the RISC to target other messages. Interestingly, miRNAs can regulate 
their own expression or biosynthesis by targeting the miRNA processing machinery. For example, the 
miR-103/107 family can inhibit DICER expression and induce epithelial to mesenchymal transition 
(EMT) promoting metastasis in human breast cancer [106]. 

4.1. OncomiRs: Cancer Associated miRNAs 

The association of miRNAs with the initiation, progression and key control pathways of 
human malignancies holds great potential for new developments in advanced diagnostic and 
therapeutic strategies in the management of most common cancers. The expression of miRNAs 
are deregulated in cancer by a variety of mechanisms including amplification, deletion, mutation 
or epigenetic silencing [107-109]. Epigenetic regulation of miRNAs is mediated by promoter 
hypermethylation in certain human cancers. For example, miR-127, which is downregulated in 
human cancer cells, has been reported to be located within a CpG island and highly up-regulated 
by DNA demethylation and histone acetylation [109]. Many groups have discovered "miRNA 
signatures" in both hematological and solid tumors that discriminate cancers from normal cells 
and have potential for improving prognosis, management of progression and possibly suppression 
of cancer [97,110-114]. miRNAs are often regarded as "oncomiRs" meaning miRNAs involved in 
dominant cancer regulatory mechanisms. OncomiRs can be categorized as tumor oncogenes and 
tumor suppressors as anti-oncomiRs. miR-155 was one of the first identified oncomiRs that has 
been demonstrated to be highly expressed in several well-known lymphomas, leukemias, breast, 
colon and lung cancers [111,113-116]. Like miR-155, other oncogenic miRNAs usually target tumor 
suppressor genes and cell cycle inhibitors, or other anti-proliferative genes and they can also serve 
as potential therapeutic targets. Another strong oncogenic candidate miRNA is miR-21 which 
is upregulated in a wide variety of blood related and solid tumors including myeloid leukemia, 
lymphocytic leukemia, gliobalstoma and cancers of the pancreas, prostate, stomach, colon, lung, 
liver and breast [110,113,117-119]. Overexpression of miR-21 in these cancers inhibits the apoptotic 
pathway promoting dysregulated cell proliferation. miR-21 was also one of the first miRNAs 
identified in the human genome that showed strong evolutionary conservation across a wide range 
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of vertebrate species. Three major targets of miR-21 include prominent tumor suppressors such as 
PTEN (phosphatase and tensin homolog), an important regulator of cardiovascular disease, PDCD4 
(programmed cell death 4) and TPM1 (tropomyosin 1) [119-122]. 

The let-7 miRNA was one of the first anti-oncomiRs, or tumor suppressor miRNAs, 
characterized, which is highly conserved among mammalian species, and is downregulated in many 
tumors including lung and breast cancers [111,114,123]. The let-7 miRNA family functionally inhibits 
a number of well-characterized oncogenes such as ras, c-myc and HMGA2 and induces apoptosis 
and cell cycle arrest in human colon cancer cells [123-126]. This miRNA targets the ras oncogene in 
lung cancer by being abnormally under-expressed promoting cell cycle progression [123]. In addition, 
let-7 also downregulates the expression of c-myc, a transcriptional activator of many tumor promoting 
genes that are dysregulated in lymphomas. Thus, anti-oncomiRs effectively control the expression of 
many oncogenes and their transcription factors at a post-transcriptional level. 

4.2. Regulation of miRNAs in Human and Canine Breast Cancers 

The association between altered miRNA expression signatures and breast cancer metastasis 
has been described by many studies [127,128]. A large number of miRNAs have been identified 
as deregulated in human breast cancer compared to normal breast tissue. The overexpression of 
certain oncogenic miRNAs (miR-21, miR-27a, miR-155, miR-9, miR-lOb, miR-373/miR-520c, miR-206, 
miR-18a/b, miR-221 /222) and the loss of several tumor suppressor miRNAs (miR-205/200, miR-125a, 
miR-125b, miR-126, miR-17-5p, miR-145, miR-200c, let-7, miR-20b, miR-34a, miR-31, miR-30) lead to 
loss of regulation of vital cellular functions that are involved in breast cancer pathogenesis [127,128]. 
In human breast cancer, miR-21 upregulates the EMT, the PI3K/ATK signaling pathway, the 
anti-apoptotic pathway and induces proliferation by targeting very well-characterized tumors 
suppressors such as PTEN, TPM1, and PDCD4 [121,122,129,130]. Strikingly, all of these miR-21 
targets have been reported to be deregulated in canine mammary tumors as well. In this regard, 
expression of selected miRNAs associated with human breast cancers have been investigated in 
canine malignant mammary tumors. Almost all of the canine miRNAs in CMTs followed the same 
expression profile observed in human breast cancers when compared to normal canine mammary 
tissue. This investigation revealed that miR-21 and miR-29b were significantly up-regulated and 
miR-15b, miR-16 were significantly down-regulated in breast cancers in both species [131]. 

4.3. miRNAs Regulate Cell Cycle by Targeting Multiple Genes 

An important function of miRNAs is to regulate cell cycle progression and arrest by targeting 
multiple cell cycle regulatory genes. These miRNAs regulate cell proliferation by specifically 
targeting cyclin-CDK complexes and CDK inhibitors. One of the first discoveries that connected 
miRNAs and cell cycle regulation was the anti-proliferative potential of the miR-15a/16-l family 
that target multiple cell cycle genes involved in cellular proliferation and growth arrest [132-135]. 
The miR-16 family act as tumor suppressors that induce cell cycle arrest at the G1 phase by targeting 
several cyclin-CDK genes including CDK6, cyclin Dl, cyclin D3, E2F3 and WEE1 and all the miRNAs 
in this family are downregulated in a wide variety of tumors [136]. Additionally, miR-34 and other 
family members, target CDK4/6, cyclin Dl, cyclin E2, E2F1/3 and c-myc, indicating their strong 
anti-proliferative roles [137]. These miRNAs are transcriptionally activated by p53 and are involved 
in the p53 signaling pathway thereby acting as mediators of tumor growth suppression [138]. 
However, the tumor suppressive miRNAs involved in cell cycle regulation are inactivated in tumors 
by epigenetic mechanisms, such as hypermethylation, leading to overexpression of their target 
genes [139]. For example, members of the miR-290 family positively regulate G1 to S phase 
transition by inhibiting cyclin-dependent kinase inhibitors such as p21, during embryonic stem cell 
differentiation [140]. The Cip/Kip family CKIs are targeted by miR-17-92, miR-106b, the miR-221 
family and miR-25 in many different carcinomas [136]. Expression of pl6/INK4A is repressed by 
miR-24 and miR-31 which are also involved in the regulation of cell proliferation and progression 
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of cell cycle in many cancers [141,142]. It has been reported that miR-21 negatively regulates cell 
cycle during G1 to S phase transition in response to DNA damage and inhibits Cdc25A expression 
affecting G2/M progression in colon cancer cells [143]. Another study showed that miR-322/424 
and miR-503 are upregulated during myogenesis and these miRNAs promote cell cycle arrest at 
G1 phase by down-regulating Cdc25A [144]. A recent report revealed that canine miR-141 can 
post-transcriptionally regulate pl6/INK4A and pl4ARF transcripts while groups of differentially 
expressed miRNAs may potentially target the rest of the CKI gene family members as well as 
oncogenes of the cell cycle in canine breast cancer models [145]. All of these reports clearly suggest 
that the cell cycle G1 to S phase transition is tightly regulated by several families of miRNAs. 
Therefore, different miRNAs regulate the cell cycle both positively and negatively by targeting the 
expression of many genes at different stages, and dysregulation of most of these regulatory molecules 
and pathways have been implicated in different pathological or developmental conditions. 

5. Conclusions 

In conclusion, the strong similarities in genome sequence, along with highly similar 
characteristics for spontaneous tumor models, have raised great promise for further comparative 
genomic research between humans and dogs. Comparing spontaneous mammary carcinomas 
in female dogs with breast cancer in women has significantly improved our understanding in 
deciphering the molecular mechanisms, relevant risk factors, and genetic profiles of these types of 
cancer and as well as novel strategies for future therapeutic inventions. However, although there 
is great potential in canine cancer models, a large number and complete interactions of cancer 
associated genes such as the cell cycle regulators, including the INK4 tumor suppressor genes and 
emerging miRNAs in the canine genome, have not been well studied in such models. Additionally, 
the high correlation between tumor suppressor gene expression and miRNA activity imposing 
post-transcriptional regulation is one of the central areas in cancer research which also needs to be 
further explored. 
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Introduction 

Marianne Isabelle Martic-Kehl, Michael FW Festing, Carlos Alvarez, Jannis Wernery & Pius August 
Schubiger 

Animal models in biomedical research 

Modern biomedical research relies heavily on the use of laboratory animals, 
particularly mice, rats and fish which, in the UK in 2013, accounted for 94% percent 
of animals used in research. The research included fundamental studies aimed at 
understanding biological processes, the pre-clinical testing of potential new drugs and 
therapies, the development of diagnostic reagents and, in the case of monoclonal 
antibodies, the production of therapeutic agents themselves. 

In all developed countries the use of animals should (and probably is) strictly 
regulated in order to minimise pain and distress. All research workers should be 
familiar with the “Three Rs”, Replacement, Refinement and Reduction described in 
the book The Principles of Humane Experimental Technique (Russell and Burch, 
1959). So where possible, non-sentient alternatives to the use of animals should be 
used as a “Replacement”. If animals must be used, then “Refinements”, such as 
anaesthesia and analgesia as well as enriched housing conditions should be used to 
minimise pain, distress or lasting harm, and the number of animals used should be 
“Reduced” to the minimum necessary to meet the objectives of the study. 

There are continued, successful, efforts to develop alternatives to the use of animals. 
For example, large numbers of animals were once used for assaying many biological 
reagents such as hormones and vaccines. These have now largely been replaced by in- 
vitro methods such as direct immunological or chemical assays. Fundamental 
research uses large numbers of mice but also makes extensive use of cell cultures and 
tissues from animals, which have been humanely euthanased. An important 
“Refinement” has been the development of disease-free or so called “specific 
pathogen free” mice, rats, guinea-pigs, rabbits and cats and a few other species. These 
are free of clinical and sub-clinical infections which can cause problems if the animals 
are stressed by an experimental treatment. “Reduction” is achieved by good 
experimental design in which neither too many animals are used, which would be 
wasteful, nor too few which might mean that important reactions are missed. 

In vivo research takes an important role in life science, particularly in preclinical drug 
development. The drug development process as it is standard nowadays consists of 
several sub-phases (research phase, pre-clinical development, clinical phases I-III), 
taking several years and give rise to costs between 50 Million and 2 Billion US 
Dollars ( DiMasi, Adams, Light). 



The drug developing process is highly prone to attrition. One critical step is the 
translation of pre-clinical animal research results to the clinic. In the last decade, it 
was frequently revealed in many research fields, that translation rates are minimal, 
non-existent or generally shrinking (Sena 2007, Baker 2014, Arrowsmith). A literature 
investigation by Thomson-Reuters has revealed that the success rates of development 
projects in phase II clinical trials have fallen from 28% to 18% between 2009 and 
2010 (Arrowsmith). In more than half of the cases the reason for attrition identified 
was insufficient efficacy. 

Animals in the drug development process - historic background 

Animals as surrogate organisms for humans were already used in the 19 th century for 
the purpose of understanding chemistry-based drug effects on physiological function. 
A first Pure Food and Drugs Act in the USA (1906) described official standards for 
drugs and proper labeling and prohibited the interstate commerce of unsafe drugs. In 
1938 the Food, Drug, and Cosmetic Act further required proof of safety and 
authorized inspections as a consequence of the sulfanilamide-disaster in 1937 that 
killed over hundred people in the USA. The reason for the toxicity was that the syrup 
with the antibiotic sulfanilamide administered against coughing contained the toxic 
solvent diethylene glycol. The syrup was very popular with children due to the sweet 
taste of the diethylene glycol. 

The first privately financed, nationally supervised and evaluated drug was 
Streptomycin, in 1945. However regulatory authorities and the movement towards 
supervised drug development was still restricted to the Anglo-Saxon world. At that 
time in continental Europe the view persisted that diseases were non-comparable 
processes, not suitable for statistical evaluation “since the treatment never concerns 
populations but only individual patients” (Virchow, 1847). In 1954, the German 
company Grlinenthal patented the sedative thalidomide and launched it 1957 as 
Contergan® in West Germany. Since the company considered the drug particularly 
safe they marketed it as a sleeping pill for pregnant women, as well as against 
morning sickness in early pregnancy. However two years later first reports about 
nerve damage related to Contergan® appeared and 1961 Grlinenthal had to withdraw 
the drug from the market when thousands of babies were born with extremity 
abnormalities. This tragedy known as the thalidomide-disaster, led to the first drug 
law in Germany (1961). The USA revised their existing drug law a year later to 
request proof of efficacy and sufficient pharmacological and toxicological results 
from animals to be granted a license for market authorization. This development 
founded the base for the drug development process, as it is known nowadays in 
developed countries. As up to now, it is not possible to fully replace live animals as 
human surrogates, however it is of high concern to perform such experiments in the 
most ethical way possible. 

(http://www.fda.gov/AboutFDAAVhatWeDo/History/FOrgsHistory/CDER/Centerfor 
DrugEvaluationandResearchBrochureandChronology/ucml 14465.htm; 
http://www.contergan.grunenthal.info/grt-ctg/GRT- 
CTG/Die_Fakten/Chronologie/152700079.j sp; 
http://www.contergan.grunenthal.info/grt-ctg/GRT- 

CTG/Die_Fakten/Das_deutsche_Arzneimittelrecht_nach/152700071.jsp;jsessionid=B 
82E77391EBCF93D7DCD34660297CBF2.drpl; Hildebrandt AG, 2004). 

In various official acts, for example of the Royal Society in the UK, the UK 
Department of Health or the US Department of Public Health, it is stated: “Virtually 



every medical achievement of the last century has depended directly or indirectly on 
research with animals”. Whether or not this statement has any proven evidence was 
unclear until 2008, when Robert Matthews has investigated on it. In his article, he 
came to the conclusion that even though the statement does not generally hold true, 
“animal models can and have provided many crucial insights that have led to major 
advances in medicine and surgery” (.Matthews RAJ, 2008). 

Indeed research using laboratory and domestic animals has underpinned many major 
advances in human medicine. Perhaps Louis Pasteur in the 19 th century should be 
credited with the first use of scientific methods to develop new treatments for 
infectious disease. He used dogs and rabbits to develop methods of immunizing dogs 
and humans against rabies, a virus disease (although viruses were not known at that 
time), and sheep to immunize sheep against anthrax, a bacterial disease. His methods 
laid the ground work for the development of vaccines used now to control diseases 
such a polio, measles, mumps and rubella. Two infectious viral diseases, smallpox 
and rinderpest, a serious disease of cattle, have even been entirely eliminated from the 
wild and polio has nearly been eliminated. The first oncogenic retro virus was 
discovered in 1911 by Peyton Rous who found that cancer can be induced in chickens 
by injecting them with a cell free extract from a chicken tumor. Further studies of the 
biology of murine retro-viruses such as the Bittner mammary tumor virus and murine 
leukemia virus, meant that when HIV appeared, at least the biology of retroviruses 
was largely understood. Although infectious diseases have now largely been 
controlled in developed countries, new zoonotic diseases such as that caused by the 
Ebola virus which is maintained in wild animals in West Africa and various strains of 
the influenza virus present in wild and domestic birds remain a constant threat, in 
view of the rapidity with which diseases can be transmitted throughout the world. 
And antibiotic resistant bacterial remain a constant threat. 

Transplantation of kidneys, hearts and other organs has saved many lives. This was 
made possible by the discovery of immunological tolerance by Peter Medawar in the 
1950s. At that time it was known that skin grafts between two individuals would be 
rejected but it was assumed that this was a physiological problem. Medawar showed 
that reciprocal skin grafts between two different strains of mice are rejected. 
However, if lymphocytes of a donor strain were injected into baby mice of a recipient 
strain, treated adult mice of the recipient strain would then permanently tolerate grafts 
from the donor strain. This showed for the first time that graft rejection is an 
immunological rather than a physiological phenomenon, and that it can be controlled 
by immunological methods. The development of drugs such as cephalosporin to 
dampen the immune system, again using laboratory animals, has made organ 
transplantation possible. 

The first chemotherapy was developed by Paul Ehrlich, who, in 1909 screened 606 
chemicals for activity against the spirochete causing syphilis using rabbits infected 
with the organism, and found one, salvarsan, that was effective. For a time it was the 
most widely prescribed drug in the world. The development of new drugs now 
depends on an understanding of the biology of the disease, the identification of 
possible drug targets and the screening of large numbers of chemicals likely to 
interact with the target using, in vitro and in vivo methods involving research animals. 
Any potential new drugs will be tested in animals for safety and efficacy usually in 
mice, rats and dogs before proceeding to clinical trials. 



The discovery of insulin in the early 1920s has saved many millions of lives. Banting 
and Best ligated the pancreatic duct of dogs and found that cells associated with the 
production of digestive enzymes degenerated, leaving islands of cells. These secreted 
the hormone later designated insulin, and they showed that it could be used to 
maintain diabetic dogs. The biochemist Collip developed methods of purifying it from 
porcine and bovine pancreases, using several thousand rabbits to assay it. Fortunately, 
although these insulins are different from human insulin, they are sufficiently similar 
to be effect in humans. Before then type I diabetes was usually fatal. For many years 
batches of porcine or bovine insulin had to be assayed using mice or rabbits. 
Frederick Sanger sequenced the insulin protein in 1955 and genetically modified 
human insulin is now produced in bacterial cultures and is assayed chemically. 

Antibiotics have probably saved more lives than any other medical intervention. 
Penicillin was discovered by Alexander Fleming in 1928, but he was unable to isolate 
it and verify that it was effective. This was done by Ernst Chain and Howard Florey 
who were able to show that it was both effect and non-toxic in mice. They went on to 
develop a method of producing it on a large scale. Many other antibiotics have been 
discovered since then. For example, Selman Waksman discovered streptomycin in 
research involving mice, guinea-pigs and chickens. 

Nutritional deficiency diseases are fortunately now rare in developed countries, 
though still a problem in some under-developed countries. Frederick Gowland 
Hopkins showed that young rats given diets of purified protein, carbohydrate, 
minerals and fat stopped growing, but when they were given a small amount of milk 
they grew. He postulated that there are substances required in the diet in minute 
amounts, later called vitamins. The vitamin that he discovered was designated 
vitamin A. His work coincided with that of Christiaan Eijkman who was attempting to 
find the cause of beriberi, a disease characterised by loss of feeling in the feet and 
difficulties in breathing. He injected the blood of soldiers, in hospital with beriberi, 
into chickens. But he noticed that some of the chickens which, were fed on scraps of 
the same diet of polished rice as the soldiers, were also getting sick. He showed that 
chickens which received unpolished rice remained healthy. The disease was caused 
by a deficiency of what we now call vitamin B1 (thiamine). Hopkins and Eijkman 
shared the 1929 Nobel Prize for their work. 

The few examples given above demonstrate how, historically, animal research has 
contributed to the development of many areas of medicine. The development of 
monoclonal antibodies is a relatively recent advance which has resulted in a limitless 
supply of highly specific diagnostic reagents as well as many promising new 
therapeutic agents. B-cell multiple myelomas have been known in humans for many 
years, and it was known that they produced monoclonal antibodies, known as Bence- 
Jones proteins. In the late 1960s it was found, (Potter, 1972) that the BALB/c inbred 
strain of mice produced myelomas when injected i.p. with mineral oil. These 
myelomas were immortalised and could be maintained as permanent cell cultures. In 
1975 Kohler and Millstein fused these myeloma cells with spleen cells from mice 
which had been immunised to sheep red blood cells and found that the “hybridomas” 
secreted antibodies to sheep red blood cells. They were able to select out individual 
hybridoma cells, each of which produced a monoclonal antibody. Subsequently, the 
immunoglobulin genes of the mice were replaced by the equivalent human genes 



using genetic engineering so that human rather than murine monoclonal antibodies are 
produced. This avoids any possible problems associated adverse reactions to mouse 
proteins. Monoclonal antibodies (Mabs) are now used to treat several diseases such as 
some forms of cancer, and as anti-inflammatory treatments of diseases like 
rheumatoid arthritis and Crohn’s disease. Many more are being tested in clinical 
trials. Due to their high specificity they are also widely used in the diagnosis of 
disease. 

Other examples where animals have made important contributions include blood 
transfusion, joint replacements, reproduction and in-vitro fertilisation (allowing many 
otherwise infertile couples to have children), heart valve replacement, cancer and 
stroke. Moreover, veterinary medicine and human medicine are converging. Dogs 
and humans get many similar diseases such as cancer, obesity and type II diabetes. 
Dogs also get a number of hereditary diseases, in some cases as a result of many 
generations of selective inbreeding for breed characteristics which are inappropriate 
or incompatible with health. 

Problems with translation of animal data to the clinic 

Despite the impressive examples described above, many articles in scientific and non- 
scientific journals have criticized the quality and reporting of animal research in drug 
development in the last decades. For some diseases, animal models were found to 
have no predictive value for clinical applications. A mouse model developed to 
investigate cystic fibrosis, for example, turned out to show symptoms different from 
human patients, even though the same genetic modification was introduced (. Ameen, 
2000). Another example is the search for Human Immunodeficiency Virus (HIV) 
vaccinations using non-human primates as surrogate organism. Chimpanzees and 
macaques infected with the Simian Immunodeficiency Virus (SIV), a similar virus to 
the HI virus and of which the HI virus is assumed to have developed from, turned out 
to be responsive to various vaccination candidates, whereas none has been translated 
successfully to human patients so far ( Langley, Buckland, Bailey 2008, Bailey 2011). 
Certainly, men are not 70kg mice and it is probably utopic to assume that the efficacy 
of any drug candidate can be 100% reliably predicted by the investigation of an 
animal surrogate, nevertheless, literature analyses in various fields of research have 
revealed a variety of potential causes apart from pure genetics for low predictive 
value of animal research data. Poor experimental planning, inappropriate statistical 
analysis and insufficient reporting are keywords frequently summarized in literature 
{Pound, Bailey 2005, Hackam 2006, Hackam 2007, Knight, Matthews, Langley, 
Muhlhausler, PLOS Medicine editors, Eisen, Baker, Bertotti) and it is well 
conceivable that the predictive value of animal research can be increased substantially 
by eliminating such methodological shortcomings. 

A disease field where a lot of work has been done to tackle down potential reasons for 
translation failures is acute stroke. Literature analysis revealed that almost five 
hundred intervention candidates have shown satisfactory efficacy in animal models, 
whereas only three interventions are proven to be effective in patients suffering from 
acute stroke {O’Collins 2006, Sena 2007). For various interventions with positive 
outcome in animal models, meta-analyses were performed to investigate potential 
reasons for this high failure rate. Judging from a checklist with ten quality criteria, 
researchers found a trend towards overestimation of effect size in studies suffering 
from low quality {Sena 2007). 



By analyzing quality shortcomings of the investigated studies, the investigators 
identified two main groups, which can be summarized as, first, general, stroke- 
independent and second stroke-specific shortcomings. The latter included the use of 
animal models (mostly mice) that did not reflect the general health state of an average 
stroke patient. Patients are often elderly, suffering from additional health problems 
like hypertension or diabetes ( Sena 2007), whereas mice are young and healthy apart 
from the artificially introduced lesion to trigger stroke symptoms. Furthermore, other 
researchers identified discrepancies in the administration schedule of a particular drug 
candidate. Treatment onset occurred much sooner in animals (median ten minutes) 
than in patients (median five hours) ( Per el 2007). 

The other, more general group of quality shortcomings concerns the frequent neglect 
of study design and performance concepts in animal research, which are standard for 
clinical trials. These include random allocation of animals to test and control groups, 
blinded performance and assessment of study outcome and sample size calculation 
prior to study performance in order to guarantee a certain study power (which should 
by convention minimally be 80 to 90%) ( Dirnagl 2006, Sena 2007, Perel 2007). 
Similar issues with study quality were observed in Amyotrophic Lateral Sclerosis 
(ALS) research. Even though well-established animal models for ALS exist, 
translational success to the clinic is absent to date ( Scott 2008). Like in the case of 
acute stroke research, it was found that the general quality of study design of the 
investigated literature was similarly poor, with lack of randomization reporting, 
blinding and power discussions ( Benatar 2007). 

It is widely appreciated that study power and bias can determine the likelihood that 
any research finding is true. It is intuitively clear that small studies of weak effects 
will be predominated by false positive discovery. However, many investigators 
appear to ignore the impact on reproducibility of the number of other studies of the 
same question that one is investigating. Further, it seems likely that many biomedical 
researchers do not appropriately consider how the probability of true findings is 
affected by the ratio of true- vs. non-relationships in the total number of tests. The 
first protection from irreproducible results was the conception of the scientific 
method, which has evolved into the development of various study and statistical 
analysis designs. Each field of scholarship tends to establish a set of gold-standard 
approaches to balance true discovery and feasibility. An underlying requirement of 
this approach is the deep involvement of statisticians in the earliest steps of 
experimental design (e.g., what is the question?; what are the properties of the data?; 
etc.). Unfortunately, it is common for investigators to focus more on study logistics 
and technical aspects. Many gather full datasets before carefully considering how the 
data will be analyzed. A significant proportion of biomedical investigators lack the 
necessary statistical expertise and it is difficult to appreciate how frequently they 
receive appropriate support from others [9]. One indication that reproducibility is a 
problem is the growing trend of biomedical journals to ask manuscript reviewers to 
specifically address statistical analysis (and advise if it is necessary to implement a 
formal statistical review). Similarly, the National Institutes of Health (NIH) is 
developing such initiatives, focusing on training scientists in experimental design, 
improving evaluation of grant proposals for aspects of study plans that most influence 
reproducibility (e.g., analytical design, blinding and randomization), and access to 
published and unpublished primary data [10]. 



The above-mentioned aspects of study design are very crucial not only when it comes 
to performance, but also when it comes to reporting of studies. Additional examples 
of experimental aspects that need clear specification in publications are details about 
investigated animals, their handling and housing. Information about the sex of 
investigated animals is often missing and generally, there is a bias towards the use of 
male animals versus females, because it is (unjustifiably) assumed that male animals 
are less prone to confounding hormonal cycle deviations ( Couzin-Frankel 2014). It 
was even mentioned that the above mentioned sex bias (or better Y-chromosome bias) 
should be considered for cell studies equally ( Couzin-Frankel 2014). 

Insufficient reporting can bias the judgment of decision-makers regarding the 
translation of drug candidates from the bench to the bedside. Meta-analyses and 
systematic reviews provide powerful tools in such processes, as they estimate effect 
sizes of drug candidates by colliding all available published information. It has to be 
bom in mind, though, that only published data can be analyzed in such studies. If 
researchers do not publish their findings or if they report study details sufficiently, 
such reviews will be strongly biased. 

It has been proven, that the neglect of randomization and blinding leads to 
overestimation of effect size ( Bebarta 2003, Sena 2007). It is difficult to judge, 
though, whether researchers not reporting on randomization in fact did not perform 
randomization or whether they just failed to report on it. On the other hand, reporting 
of randomization does not necessarily mean that randomization has been performed in 
a suitable way. It was revealed that many animal researchers do not seem to be aware 
of how (fully) randomized allocation of study subjects is performed appropriately 
(< Couzin-Frankel 2013). 

Generally, it was observed that studies with positive effect sizes (and often small 
sample sizes) tend to be published more frequently than studies with neutral or 
negative outcome. This phenomenon is referred to as publication bias in literature. 
Publication bias leads to distorted results of systematic reviews and meta-analyses. In 
the case of acute stroke research, it was estimated that approximately 30% of the 
pooled effect size for a certain drag estimated via a meta-analytic approach, was due 
to publication bias ( Sena 2010). This corresponds to a substantial bias and it can be 
assumed that it this bias might reach similar dimensions in other fields of research. 

Assuming the best technical and statistical practices are applied and reported, there 
are other aspects that can affect what we think we know to be true. Such factors can 
be psychological (e.g., self-deception [13] or effects related to professional incentives 
and personal motivations [3]) or philosophical. For instance, Karl Popper proposed a 
philosophy of science that addressed his notion that one cannot be certain that 
something is true - that we can only be sure that something is false [14]. The classical 
rule of inference is modus ponens (method of affirming): “If one knows the following 
two statements “If p, then q.” and “p.” are true, then one knows that “Therefore, q.” is 
true. But this approach requires absolute conditionals and generalizations in order to 
be true; such conditions can be challenging in real life. Put another way, the argument 
can be universally valid in logic but - if at least one of the premises is false in a given 
case - unsound. The alternative inference rule, central to Popper’s philosophy of 
science, is modus tollens (method of denying): “If p, then q. Not q. Therefore, not p.” 
This is now the dominant scientific approach. Popper’s method starts with a theory 
from which deductive reasoning is used to make conclusions and predictions. 



Ultimately, critical testing of the predictions cannot prove that the theory is true; 
rather, testing can only falsify a theory. This is the reason why hypothesis-testing 
science is so dominant. 

Animal studies in anti-cancer drug development 

Failure rates of drug effects in the clinical test phase after successful animal 
experiments were reported to be highest in the field of oncology ( Hutchinson 2011). 
In 2011, the licensing success rate for anti-cancer drugs reached 5%, in contrast to 
20% for cardiovascular diseases ( Hutchinson 2011). 

Exemplarily, the situation of animal research quality within anti-cancer drug 
development shall be illustrated below. A systematic review about this drug category 
in preclinical animal experiments has revealed that less than half of all identified 
publications (n=232) were reporting on randomization (41%), blinded assessment of 
outcome (2%), allocation concealment (0%) and sample-size calculation (0.5%) 
( Martic-Kehl ). Even though many articles were published in the last decade that 
emphasized on the importance of such study design features, there was no increase in 
reporting of them observable from the late 1990ies until 2011 ( Martic-Kehl ). The only 
exception was the increased incorporation of conflict of interest statements in more 
recent articles than in older ones ( Martic-Kehl ). This phenomenon can easily be 
explained by more extended author’s guidelines of numerous scientific journals, 
which clearly request a conflict of interest statement. 

It seems that external enforcement, e.g. by journal editors, are necessary to achieve an 
improvement of reporting quality and - presumably performance quality - of animal 
studies. 

Furthermore, there was a tendency towards rather modest to inexistent drug effect for 
studies of higher quality compared to low quality studies (Martic-Kehl). 

Anti-angiogenic cancer drugs represent a striking example of how animal models can 
mislead decision-making if they are not applied in a clinically relevant way. On the 
other hand, the anti-angiogenic cancer drug research proves well how clinically 
relevant animal models can provide important information not only about efficacy, 
but also about potential harmful side effects of drug candidates. 

After the permission on the market, some evidence came up that certain anit- 
angiogenic drugs might trigger metastatic evasion of cancer cells in patients (Ebos 
2010). Retrospectively, it was found that this phenomenon could have been foreseen 
by investigating metastatic cancer models (highly clinically relevant) in mice. Primary 
cancer models did not show corresponding results (Ebos 2010). 

Similarly, the tumor location within the animal model can play a crucial role for 
predictive value as well. The easiest way of tumor inoculation into an animal and in 
addition the cheapest one is a sub-cutaneous injection at a prominent body part like 
shoulder or flank. An advantage of such locations is the easy follow-up of tumor 
growth. On the other hand, tumor cells completely grow out of their naturally 
occurring stromal conditions, which might influence their growth and reaction on 
cancer drugs crucially. For pre-clinical drug testing, it would therefore make sense to 
use orthotopic tumor models where tumors are inoculated at the organ of tumor cell 
origin. Eventually the animal species has to be considered carefully; a spontaneous 
dog or cat tumor may be genetically and behaviorally closer to human tumors than an 
induced human tumor in mice, which would not occur naturally. 



Aim of the book 

This book is dealing with animal models for cancer drug development, particularly for 
pre-clinical efficacy studies. It is thought to be complementary to the book “In vivo 
models for drug development” edited by Wiley in 2013 and with its focus on 
oncology drug development. This book aims at first to discuss all the relevant 
parameters, which are of importance to generate reliable animal results and second to 
show explicit examples of additional ways of preclinical tumor drug testing as well as 
to give practical recommendations to ensure maximal predictability of therapy 
efficacy on humans. 

In the first chapter, ethical aspects of animal experimentation are depicted, as a base 
for the use of animal models in general. 

The following three chapters contain general concepts, which are not specific for 
cancer research, but under all circumstances should be considered and implemented in 
cancer research as well. Those chapters broach the issues of study design and proper 
reporting, reporting bias and animal housing and handling. 

In the fifth chapter clinically relevant animal models for anti-cancer drug 
development are discussed in detail. 

The first subsection is discussing mouse models of advanced spontaneous metastasis 
and the discrepancies in efficacy between primary and metastasis models. This topic 
is followed by important lessons to be learned from human and canine trials to 
improve animal models in research through back translation. 

Animal models for cancer are then expanded from rodents to species like cats and 
dogs and their potential use as spontaneous cancer models in research. The use of dog 
models is deepened in detail, depicting their potential advantage and complementary 
potential compared to models of induced tumors in rodents. 
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8.4 

Dog models of naturally occurring cancer 

Joelle Fenger, Jennie Rowell, Isain Zapata, William Kisseberth, Cheryl London 
and Carlos Alvarez 


8.4.1 

Introduction 

For more than one decade, investigators have sounded the alarm that most biomedical research 
findings are false [1-4] and have asserted that this contributes to the very low success rate of 
translating preclinical research to new human therapies [5, 6]. While some of the problem can be 
attributed to aspects such as poor experimental design and analysis, it is also important to be 
aware of the limitations of specific animal models. When choosing animal models for drug target 
validation and development of therapies, it can be useful to begin by considering the weaknesses 
of the current paradigm. Two leading reasons for the frequent failure of clinical trials are i) 
insufficient understanding of the pathophysiology or the biochemistry of the targeted pathway, 
and ii) unacceptable toxicity or other side effects of therapeutic treatments. Our overarching goal 
here is to demonstrate how biomedical canine models can be used to dramatically advance our 
understanding of human biology, pathophysiology and clinical translation [7-10]. We stress that 
the canine model is not an alternative to the mouse. Rather, both models have unique strengths 
that complement each other and human translational investigation. We emphasize two concepts 
that can increase true discovery (and thus reproducibility): increasing power and reducing bias. 
Dog models achieve this by being a simplified and more homogenous version of human disease 
within breeds or breed groups, but having increased variation across breeds. 

8.4.1.1 

Animal models of human disease and the need for alternatives to the mouse 

Animal models of human biology and disease can have vastly different utilities. Comparative 
biology and genomics allow investigators to determine the extent to which human physiology is 
conserved across phylogeny. Biological models are also necessary for the execution of 



experiments that are not ethical in humans. The genetic simplification of a disease model and the 
control of as many experimental variables as possible can also be critical to biological 
understanding. The more complex a phenomenon is and the weaker the effect of individual 
contributions to a trait, the more important this is. Lastly, experimental flexibility, speed and cost 
are also prominent considerations when choosing the best organism in which to study a given 
question. In many cases, multiple organisms are used. One of the most common final aims is the 
creation of a rodent model that can be used for validation of human disease targets, mechanistic 
studies of pathophysiology and/or therapeutic studies. Presently, there are approximately a dozen 
models that have been established through characterization of their biology and the creation of 
genetic tools. Those experimental workhorses cover bacteria, protozoans, plants, yeast, nematode 
worms, flies, fish, frogs, birds and mammals [11]. 

Nobel Prize history provides a good measure of the impact of animal models on medical 
discovery [Official Nobel Prize web site, 12, see table of animal models at Foundation for 
Biomedical Research, 13]. The majority of investigations that resulted in Nobel Prizes in 
medicine (since the first in 1901) involved animal models. Since 1979 that holds for all awards 
except one that exclusively used a plant (1983, Barbara McClintock, for the discovery of mobile 
genetic elements). Most relevant to this review, there have been 29 award winners who used dog 
models (9 of those only used dogs). But if one compares the use of dogs or cats vs. mice or rats 
over time, a pattern is evident: from 1901-1973, the ratio was 35:24, respectively; from 1974- 
2013, the ratio was 7:53. Below we discuss how this gain in the use of rodent models for 
experimental investigation has recently begun to be offset by an explosion of veterinary and 
genetic research in dogs. 

Although the tradeoff between experimental tractability and human relevance is clear 
[11], investigators have tended to believe that their options of animal models are restricted. Such 
notions were historically driven by study feasibility: mainly decreased heterogeneity of inbred 
lines, and reduced costs of small animals with short generation times. Until the last decade, the 
only mammal with a full spectrum of genetic tools was the mouse. Now it is possible to mediate 
targeted mutagenesis in diverse animals [14], Similarly, in the past it was difficult to work 
outside a small number of animal models that had at least a known genome sequence and 
annotation. But now, since high throughput methodologies allow for sequencing on the scale of 
the human genome for approximately 1,000 USD, it is possible to rapidly initiate genomic 



studies in any organism. Non-human primates offer unparalleled human relevance, but have 
great limitations for some experimental methodologies, have very high costs, and carry a heavy 
or unsustainable ethical burden. While bacteria, protozoa, yeast and invertebrates tend to be very 
flexible, rapid and inexpensive, they lack the relevance of vertebrate physiology [15]. In the area 
of cancer, there are many questions where mammalian models are necessary or highly 
preferable. Among the aspects that can be unique or very different in mammals are the existence 
of the mammary gland, reproduction, gestation, post-natal growth rate, warm-bloodedness and 
many other metabolic traits, biology of the skin and lungs, and particulars of the immune system 
[16]. 

Mice have the strengths of being evolutionarily close to humans and being relatively 
inexpensive to use [17]. Dozens of inbred strains have been created. Diverse genetic and 
molecular tools and reagents have been developed. This has led to the greatest body of 
investigation available for any animal model - encompassing genetics and genomics, 
physiology, genetic and other inducible disease models, mutagenesis screens and therapeutic 
drug screens. Rodents have thus become the predominant models across essentially all disease 
areas. However, scientists using mice as biomedical and pharmacological models of human 
disease must be aware of their potential weaknesses that can impact clinical translation [5, 6, 18]. 
A major limitation that receives little attention is the lack of bio-epidemiological understanding 
[19]. Murine cancer studies very rarely involve naturally occurring cancer. Instead, most mouse 
models involve the use of highly-penetrant oncogene transgenics. The normal roles of 
heterogeneity, complex genetic and gene-environment effects, aging, and early somatic mutation 
are thus completely removed from these models of tumorigenesis. A high proportion of murine 
cancer studies also lack other key aspects of human oncological relevance such as tissue- 
specificity, drug response, recurrence, metastasis and malignancy. Many studies, such as human 
xenograph models, require the use of immunoincompetent mice and thus ignore a central 
mechanism of cancer risk and development [20]. Consideration of these various limitations of 
rodent models highlights the complementary nature of veterinary animal models. 


8.4.2 

Advantages of spontaneous cancer models in dogs 



By comparison to rodents, dogs are outbred, have more limited tools for genetic manipulation 
and are vastly more expensive to use (as they are larger and have longer generation times). That 
and trends in ethics have resulted in a relative decline in canine experimental models. 
Experimentation on dogs is widely believed to be unethical in countries where dogs are 
considered close companions (often viewed as family members) or indispensable working 
animals. However, dogs have many advantages that are becoming appreciated more and more 
broadly [9, 10]. The best known of those are powerfully-simplified genetics offered by hundreds 
of isolated breeds, 5- to 8-fold accelerated aging compared to humans, and health care levels 
second only to humans. Most major diseases of dogs are documented and many breed-specific 
risk factors are known (e.g., section 8.4.3 Dog cancer models; see Table 8.4.1& Figs. 8.4.1-2). In 
this section we highlight the unique traits that make dog models effective in the integration of 
biomedical genomics and translational medicine [10, 18, 21]. It should be noted that the various 
strengths are intimately associated and have synergistic effects on investigational power. 

8.4.2.1 

High level of evolutionary conservation with humans 

Although humans and mice are more closely related to each other (divergence time ~74 MYA) 
than they are to dogs (~81 MYA), humans and dogs are more similar [22]. Dogs are genetically 
more similar to humans than mice are, and have about 650 Mb of genome sequence that was 
present in the common ancestor of dogs, mice and humans - but which has been lost in mice 
[23]. In addition, dogs are more similar to humans at the protein level [23, 24]. The reason 
behind these facts is relatively-accelerated divergence in rodents, a complex phenomenon that is 
generally associated with animal size and other traits [25]. There are nearly 400 inherited 
diseases characterized in dogs [26]. Essentially all of these resemble a similar condition in 
humans. At least -73% of pure-bred dogs today have an inherited disease condition [27]. They 
represent all disease areas and most diseases seen in humans - including diseases of complex 
genetics such as diabetes and cancer. A caveat here is that the resources for conducting 
epidemiology of complex diseases in dogs are not in place. However, it is evident that the 
prevalence of diverse complex diseases ranges from similar to very different between humans 
and dogs. For instance, heart disease is much more common in humans. With regards to cancer, 
some have similar prevalence in both species (e.g., mammary), but others can be common in one 



species and rare in the other [e.g., osteosarcoma (OS) is common in dogs and rare in humans; 
and prostate cancer is vice versa, 28; see Table 2], In the cases where diseases have been 
dissected at the genetic level, the gene or pathway involved in dogs has been the same as in 
humans [9]. 


S.4.2.2 

Reduced heterogeneity within breeds and increased variation across breeds 

A great advantage of dog models is simplified genetics [9]. This stems from their evolution that 
involved i) a major population bottleneck in their domestication from wolves (about 11,GOO- 
16,000 years ago), and ii) a series of population bottlenecks upon the creation of breeds from 
thousands of years ago to the present [29]. The majority of the nearly 500 extant dog breeds were 
created in the last 150 years. Selection of variation was largely based on morphology and 
behavior, making dogs the phenotypically most-varied land mammals. It is estimated that only 
5% of the variation in wolves was lost in the domestication of dogs [30], However, the creation 
of each dog breed is estimated to have removed -35% of the total variation in all dog breeds. 
Thus, dogs are genetically extremely similar within breeds, but are dramatically different across 
breeds. The extent of both of those traits is very exaggerated compared to the same in the main 
human ethnic groups - such as African, Asian and European - which vary -0.1% within groups 
and -10% across groups [31]. This relative genetic-simplicity provides several strengths. Each 
breed is on the order of 100-fold genetically simpler than the full dog (or human) population. 

The hundreds of breeds can be genetically clustered into 10 or so breed groups - with many 
members that have different levels of relatedness to each other. That makes dogs ideal for rapid 
mapping of traits to broad genome regions within breeds [as the extent of linkage disequilibrium, 
or LD, within breeds is on the order of 50-100 fold greater than in humans; see 9, 32], and fine 
mapping across breeds [33], 

As a result of the evolutionary history of dogs under human domestication and their 
within-breed genetic simplification, dogs have vastly reduced heterogeneity in at least some 
complex traits. Thus the genetics and phenotypes are vastly more homogenous, and these two 
aspects synergistically increase the power of genome scanning of complex traits. The fact that 
dog breeds have extreme variation across diverse traits of all types further increases the strength 
of dog models for biomedical research. This can also be used to avoid biases common in studies 



of single breeds, strains or species. As an example of heterogeneity in humans vs. dogs, let’s 
consider the trait of height, which has a high level of heritability in both species (-80% for 
humans). A meta-analysis of 46 human genome wide association studies (GWAS’s; discovery 
n=133,653; replication n=50,074) for height uncovered 135 associated markers that explained 
-10% of height variation [32]. In contrast, multiple studies of hundreds of dogs from dozens of 
dog breeds revealed six genome loci ( IGF1, IGFR1, GHR, SMAD2, HMGA2 and STC2 ) that 
account for at least 53% of the size variation across all but giant breeds [34], One study of 57 
morphological traits in 915 dogs from 80 breeds showed that the majority of phenotypic 
variation for most of those traits could be explained by 1-3 loci [35], A prominent question today 
is whether the genetic architecture and effect sizes of associated variants of canine diseases of 
complex genetics will be similar to that of canine morphological traits, but it is too early to 
generalize about this. 

In the area of complex disease traits, a comparison between human and dog OS risk can 
be made. A large consortium of investigators conducted a multistage GW AS using 941 human 
OS cases and 3291 controls, and identified two genomewide significant loci ( GRM4 , Odds Risk, 
OR, of 1.57; and a gene desert on 2p25.2, OR 1.39) [36]. In contrast, Karlsson, Lindblad-Toh et 
al. conducted GWAS’s of OS risk in three separate dog breeds (total n=543) and identified 33 
genome wide significant loci that included prominent cancer genes CDKN2A/B, AKT2 and BCL2. 
These were estimated to explain 57%, 53% and 85% of the phenotype variance in Greyhounds, 
Irish Wolfhounds and Rottweilers respectively [see section, 8.4.3.2.2 Osteosarcoma; 37]. The 
highest observed OR’s for canine OS in those three breeds that all have a high prevalence of the 
disease were 1.36, 1.43 and 1.75. This could suggest the presence of additional loci in each breed 
that could not be mapped because the breed is fixed for the risk allele. This was found to be the 
case for seven of the loci, which were fixed in one or - for one locus - both other breeds. Other 
than that, there was no overlap between the OS loci in the three breeds, which included two 
related sighthounds. These observations represent an important caveat that has to be considered 
in the design and interpretation of dog studies. The reduced heterogeneity of breeds is associated 
with high levels of homozygosity that cannot be assayed in genetic mapping studies. Those 
regions may contain variation that contributes to essentially any trait. Additionally, the use of 
breeds with greatly reduced heterogeneity suggests that the findings will not necessarily be 
directly applicable across breeds. Two ways to mitigate this issue, and perhaps change it to a 



strength, are i) comprehensive genotyping of all genetic variation (SNP and CNV; discussed 
below), and ii) analysis of multiple breeds, ideally including ones that are not closely related. 


8.4.2.3 

Potential for comprehensive genotyping 

Besides the challenge of very high heterogeneity, another limitation to human complex genetics 
is the fact that mapping studies are not able to query a major portion of variation - mainly 
structural variation (SV) or DNA copy number variation (CNV). Structural variation is that 
which alters the structure or content of DNA, such as duplications, deletions, inversions and 
translocations; CNV refers to sequence that has variable copy number (arbitrarily defined as at 
least 50 bp), such as gain or loss structural variants [38]. SV/CNV are distinct from somatic 
genomic alterations because they are restricted to germline polymorphisms (with an arbitrarily 
established minor allele frequency commonly set at 1%). The extent of CNV as measured by 
number of variant base pairs in humans is 100 bases of CNV per single nucleotide variant (SNP). 
However, it is technically very difficult to genotype the majority of SV/CNV. Most GWAS’s 
only capture a minute portion of SV/CNV, predominantly very large CNVs. Because much of 
SV/CNV involves repetitive sequences of diverse types, it is often difficult or impossible to 
develop specific genotyping assays. Whole genome sequencing in humans results in 
approximately 25% of the reads falling outside reference-templated assemblies. Those 
unmappable reads are almost entirely non-unique sequences that make up much of SV/CNV. 

Because of the reduced genetic variation within breeds, dogs are ideal mammals in which 
to catalog all heritable variation in order to develop genotyping platforms that can detect most 
variants [39-41], This could be done by identifying all SV/CNV and sequence variants in 
specific breeds, then creating an integrated map of those [18, 39, 40]. The use of novel long 
distance sequencing and analogous methods would allow determination of a high-quality phased 
assembly of breed-specific genomes. We estimate that the first version of this could be done with 
approximately 10-20 dogs of one breed. This would yield a set of tag SNPs that could detect 
most discerned haplotypes along with their known SV/CNVs. Such a SNP platform could be 
used for genome scanning or integrative genomics that incorporated an annotated catalog of all 
breed variants. The annotation of variants could include scoring of variants for likelihood of 
having functional or pathological effects. Annotation could also include classification of variants 



into biochemical or genetic networks, or according to other types of biological relevance. Such 
annotation would yield a list of gene variants that may be implicated in diseases that are common 
to the breed under study. Implicated variants could be directly tested for association with that 
disease in that breed. Notably, this information would also be useful to consider the possible 
effects of candidate functional-variants that are present in genome regions that are fixed, or 
nearly fixed, and cannot be queried in genetic mapping studies [42], 

8 . 4 . 2.4 

Understanding both somatic and germline cancer genetics 

Most of the strengths of dog models are relevant across all human disease areas, but there are 
aspects of cancer that differ from all other diseases [18]. Among those are the facts that cancer i) 
involves complex germline-risk genetics and somatic mutations, and ii) is also associated with 
broad types of environmental stimuli that include drugs and alcohol, diet, sunlight and diverse 
microbes. Another distinguishing characteristic of cancer is that it affects myriad cell types in 
essentially all tissues, and can involve almost any biochemical process of the cell. The recent 
explosion of molecular-genetic knowledge of cancer can be summed-up best as the realization 
that it is vastly more heterogeneous and complicated than we had ever imagined. This problem of 
tumor (including intra-tumor) heterogeneity is true within and across cancer types. Given those 
biological realities, canine breed models present powerful opportunities of simplified genetics 
and reduced disease heterogeneity. Compared to humans, it is vastly simpler to map traits and to 
conduct translational studies in dogs. 

The understanding of human cancer genetics has exploded in the recent past. That 
exponential advance was made possible by the development of high throughput sequencing and 
its application to the sequencing of tens of thousands of tumor genomes, exomes or 
transcriptomes [supported by other ‘omics methods: DNA methylation, comparative genomic 
hybridization, chromatin IP, small and long non-coding RNA, proteomics, metabolomics, etc.; 
43]. This has led to the identification of the majority of somatic driver genes in all common 
human cancers. Such understanding is very important to be able to go on to stratify cancers for 
biological dissection and therapeutic relevance. However, knowledge of germline cancer 
genetics in humans is almost a black box [7]. The reason for this is that the heterogeneity of 
human populations makes it impossible to map most genetic variation associated with complex 



diseases such as cancer. Even for cancers that have been studied many times in the same general 
population - e.g., breast and prostate cancer in Western Europeans - only a small portion of the 
heritability of those cancers is explained. Furthermore, human complex genetics is at a very early 
stage and the vast majority of mapped associations are only defined as a general location - 
without knowledge of the causative variant (and any hint of disease mechanism) [44], This 
complexity of common diseases is compounded by environmental risk factors. It is clear that our 
understanding of common diseases is embryonic. We must not only map gene variants, but will 
have to learn their biology, their interaction with other gene variants and their interaction with 
the environment (including with diverse microbes that are evolving rapidly). 

Many of the successes in addressing the gap of in our understanding of human germline 
disease risk came from studying multigenerational families and isolated populations, and these 
continue to be important [45]. Such populations include the Finish, Icelanders, Utah Mormons, 
Amish, and Ashkenazi Jews. Examples of early contributions relate to the first breast cancer 
gene, BRCA1, which was mapped by family studies [46], identified in Utah Mormons [47] and 
found to reveal important insights in the Ashkenazi Jews [48] among other populations. 

However, neither this nor the new approach of sequencing-based discovery of “rare variants of 
strong effect” is suited for identifying most common variation associated with worldwide cancer 
risk. To date the only other alternative has been to increase the power of GWAS’s in broader 
populations. It is no longer uncommon to have studies with 10,000 cases and as many or more 
controls. And these studies are still not revealing the majority of disease associated variants or 
heritability for complex diseases [49]. In terms of genetic simplicity, natural cancer in inbred 
rodents offers a way to address the gap in our understanding of germline cancer genetics. Such 
work is relatively rare in unmodified rodents because it would require great resources due to 
incomplete penetrance of risk variants and the requirements of advanced age. Interpretation of 
those studies is also complicated by the same reduced complexity that is desired: is the simplicity 
so great that the rodent strain is anomalous with regard to cancer risk? Rodent strains are selected 
for viability and fecundity in the presence of very little genetic variation - how are these lines 
different from rodent or human individuals and populations? In addition, the lifestyle and 
medical contexts of human cancer are very different from the laboratory setting of rodent cancer. 

Pet and working dogs are attractive subjects for studying germline cancer genetics 
because they are naturally susceptible to most cancers present in humans, share environments 



with their human companions, exist in a medical context and are genetically highly-simplified 
[see Table 8.4.3; 18, 24], As with rodent lines, the knowledge gained from single breeds may not 
be generalizable across the population. This can be addressed by studying multiple breeds from 
different breed groups. Deep understanding of canine germline and somatic genetics associated 
with specific cancers and breeds - e.g., OS in Greyhounds and Rottweilers [37] - will lead to 
unprecedented knowledge of gene-gene and gene-environment interactions in a mammal. 
Another promising approach is to test for canine cancer-risk pathways in humans. Today this is 
complicated by lack of broad genotype data for humans, but it is likely that this will change in 
the next decade. Variant genes that are associated with canine cancer risk can be tested for direct 
association in humans, but they can also be used to implicate biochemical and genetic networks 
that can be assessed for human gene-gene/-environment interactions. Dogs could make major 
contributions in the latter area. There is very intense interest in this, but it is computationally 
impossible to test all potential combinations in large human datasets using conventional 
approaches at this time [50], Canine models thus provide an opportunity to develop an 
understanding of germline contributions to diverse cancer traits. A complete picture will emerge 
when that is combined with somatic variation and environmental data collected in the veterinary 
setting. 


8 . 4 . 2.5 

Translational models 

The same strengths of the canine model that set it apart from rodent and human studies - genetic 
simplicity in a natural population with highly advanced clinical care - also offer a unique 
solution to the gap between therapeutic research and development [7]. In dogs, it is possible to 
identify the germline and somatic cancer-drivers, and then to directly test targeted therapies on 
the same species. Such proof of concept studies could dramatically reduce the cost and time 
compared to similar studies in humans [10]. Below, we give examples of how dogs are 
specifically relevant to two human cancers and how the canine model is already being exploited 
for rapid development as a cancer model. 


8 . 4.3 

Dog cancer models 



8 . 4 . 3.1 

Canine cancer incidence 

Comparative oncology integrates the study of naturally occurring cancers in animals with the 
study of human cancer biology and therapy [51]. Studying dogs with spontaneous cancer 
provides a valuable perspective distinct from that generated with other animal models because 
dogs naturally develop cancers that share similar characteristics to that of their human 
counterpart [8]; and this occurs in a population that, for the most part, is sharing many of the 
same environmental exposures with people. To this end, studies of dogs with cancer conducted 
over the past 30-40 years have significantly advanced the understanding and practice of human 
oncology in fields such as basic tumor biology [52, 53], tumor immunology [54, 55], radiation 
biology [56, 57], surgical oncology [58, 59], and systemic therapies [60] for a variety of cancers 
including OS, lymphoma, melanoma, and others. 

Canine health foundations estimate that in the USA, -8.5% of the canine population will 
be diagnosed with cancer this year (6 Million new cases/ 69.9 Million dogs) compared to -0.5% 
of the USA human population [1,665,540 new cases in 2014/318,951,181 US population; 57, 58, 
61, 62], 27% of all dogs die from cancer, making it the leading cause of death [63]. Cancer is the 
leading cause of death in dogs over 10 years, with 50% of older dogs developing the disease [63- 
65]. Advances in the care of pet animals such as in better nutrition, vaccination for common 
infectious disease, leash laws that reduce automobile deaths, and the availability of more 
sophisticated veterinary care has allowed dogs to live longer. Those factors have increased the 
population of dogs at risk for the development of cancer and other age-related diseases. The 
rising prevalence of cancer in the pet animal population may be the result of an actual increase in 
cancer incidence. However, this may also reflect an increasing interest of owners to seek out 
advanced veterinary care or to participate in research studies [66]. 

Cancer in dogs and humans share many features including histologic appearance, tumor 
genetics, molecular pathway alterations, biological behavior, and similar responses to traditional 
treatment regimens such as surgery and chemotherapy (see Table 8.4.4). Similar environmental, 
nutritional, age, sex, and reproductive factors are associated with tumor development and 
progression in human and canine cancers. Pet dogs share the same living environment as their 
caregivers, potentially serving as epidemiologic or etiologic sentinels for the changing patterns 
of cancer development seen in humans [67-72], Importantly, spontaneous cancers in pet dogs 



recapitulate the biological complexity of human cancers in that they occur in the presence of an 
intact immune system and are characterized by tumor growth over long periods of time, inter¬ 
individual and intra-tumoral heterogeneity, development of recurrent or resistant disease, and 
metastasis to relevant distant sites [8]. 

The spectrum of cancers seen in humans is similarly diverse in dogs, including OS, NHL, 
melanoma, soft-tissue sarcoma, mammary, lung, head and neck, and bladder carcinomas [65, 73- 
78]. However, the incidence can vary dramatically between the two species [28]. While the most 
common human cancers are various epithelial types, most of those are relatively rare in dogs and 
the latter have higher rates of lymphomas and sarcomas. There is now a growing body of 
evidence from cross-species genomic analyses that demonstrate significant similarities between 
genomic profiles in canine and human cancers, providing support for the notion that these 
diseases are similar at a molecular [see section 8.4.2 Advantages of spontaneous cancer models 
in dogs; 52, 76, 79, 80-82]. For the more commonly studied cancers (e.g. OS and NHL), gene 
expression profiles and genomic alterations associated with cancers in dogs are highly analogous 
to those identified in their human counterparts [53, 73, 83-85]. Additionally, specific 
biochemical pathways known to be drivers in human cancers are also observed in various types 
of canine cancers, offering the opportunity to therapeutically target those mechanisms in dogs. 
That knowledge can then translate the findings more globally to human cancers driven by the 
same cancer-causing gene mutations. For example, approximately 30% of high grade canine 
mast cell tumors possess activating mutations in the tyrosine kinase receptor gene KIT (aka, c- 
Kit) which is associated with increased risk for the development of metastatic disease and shorter 
overall survival times [86-88], While mast cell malignancies are uncommon in people, similar 
KIT mutations are found in human gastrointestinal stromal tumors (GIST) [89]. Clinical trials 
investigating the safety and efficacy of novel tyrosine kinase inhibitors targeting KIT in dogs 
with mast cell tumors have provided meaningful data correlating target inhibition with the 
mutational status of KIT and drug plasma concentrations, aiding in the translational development 
of this class of agents for KIT-driven malignancies in humans [90-92], 

Partly due to the tremendous genetic advances (sequencing of the canine genome in 2005, 
development of methods to capture genetic variations on a high-throughput genomewide scale, 
etc.), the last 10 years have seen an explosion of canine cancer research. In fact, the publications 
related to canine cancer from 1970-2000, had an annual growth rate of 1.33% (total growth of 



-40%). Compare that to 2001-2013 where the annual growth rate was 8.14% (total growth of 
-98%; see Fig. 8.4.3). In the past, the predominant focus of canine cancer research was on the 
most common cancers in dogs, some of which were rare in humans (e.g., OS). Presumably 
because mammary cancer is rare in spayed dogs, comparative oncology of canine mammary 
cancer has been relatively limited. However, there has been a recent surge in study of canine 
models of breast cancer. As with humans, mammary tumors in dogs are the most common tumor 
type in females and are significantly associated with somatic mutations in BRCA1 and BRCA2 
[93]. Additionally, recent research has recognized the many similarities regarding the 
morphology, biological behavior, and clinical course of mammary tumors in both species [94], 
We thus expect the study of dog mammary cancer to grow rapidly. Below we focus on two types 
of cancer for which the dog arguably recapitulates the human disease better than any other 
animal due to natural disease, increased genetic variation compared to inbred rodents, and the 
relative richness of epidemiological and clinical data. We highlight the potential of the canine 
model to vastly accelerate our understanding of cancer genetics and treatments. 


8 . 4 . 3.2 

Genetics of breed-specific cancer models 

8 . 4 . 3 . 2.1 

Lymphoma 

Lymphoma refers to a heterogeneous group of disorders involving monoclonal proliferation of 
malignant lymphocytes [95], Human lymphoma is commonly divided into two categories: 
Hodgkin’s Lymphoma (HL) and Non-Hodgkin’s Lymphoma (NHL). Together, 5-year survival 
rates for lymphoma are very high, ranging from -70-90% [95, 96]. HL is characterized by the 
presence of a Reed-Stemberg cell and is typically subdivided into classical HL and nodular 
lymphocyte-predominant HL [96], HL is rarer than NHL but has a better 5-year survival rate. 
Alternatively, NHL includes more subtypes then HL and the literature has reported an increasing 
incidence within the past 30 years [95] that has appeared to stabilize more recently [97]. NHLs 
represent 5% of all new cancer cases and are the fifth leading cause of cancer death, and the 
second fastest growing cancer in terms of mortality. Incidence rates of NHL are particularly high 
in western societies and for most subtypes of NHL, higher in men than women [98-100]. In dogs, 
NHL accounts for approximately 10% of all malignant tumors [83% of all haematopoietic 



malignancies; 53]. NHLs have been directly compared in humans and dogs many times, yet our 
understanding of the known canine incidence still comes from the 1970s [78], Similar to humans, 
the proliferative activity of individual NHL can vary tremendously, with tumor mitotic activity 
and miRNA diversity serving as possible clinical indices of aggressiveness in dogs [101, 102], 
Further, activating pathway processes in lymphoproliferative diseases are shared between the 
two species [101, 102]. There is evidence suggesting that the tumor microenvironment, 
activating pathway processes, clinical, cytological, and immunophenotypic properties are similar 
in the dog providing a robust model of the human disease [101-106]. 

The most common type of NHL is diffuse large B-cell lymphoma (DLBCL), representing 
over 44% of Lymphomas [107], B-cells are lymphocytes that confer efficient and long-lasting 
adaptive immunity by the generation of high-affinity antibodies against microbial antigens [108]. 
These cells form an essential part of the humoral immune response and play a central role in 
overall immune logic [108]. The World Health Organization (WHO) recognizes 4 subtypes of 
DLBCL: 1) DLCBL not otherwise specified (NOS), 2) DLCBL with predominant extranodal 
location, 3) large cell lymphoma of terminally differentiated B-cells and 4) Borderline cases 
[109, 110]. However, despite these classifications, diagnosis is complex and often does not 
provide a clear homogenous subtype and topographical pattern [111]. The degree of variability 
within histologically identical DLBCLs prompted attempts to molecularly subcategorize the 
tumors [112], Subsequent to the WHO categorization, a seminal work, using gene expression 
profiling (GEP), identified that DLBCL-NOS (the largest grouping of DLBCL) could be divided 
based on well-defined genetic signatures that have clinically predicative utility that has been 
replicated multiple times [112, 113], Activated B-cell (ABC), germinal center B-cell (GCB), and 
primary mediastinal B-cell lymphoma (PMBL), are grouped based on the differentiation and 
maturation of distinct B-cells at separate stages [114], Survival for 5-years based on subtype is 
roughly 30%, 59%, and 64%, respectively [112, 115], 

By far the most common molecular subtype of DLBCL, GCB DLBCL typically occurs in 
children and young adults [116]. The GCB subtype is characterized by a much more favorable 
outcome and a spectrum of genetic aberrations, which include the t(14;18)(q32;q21) 
translocation, deletion of PTEN, amplification of the miR-17-92 cluster, or TP53 mutations 
[117]. Expression patterns associated with the GCB subtype include markers of germinal center 
differentiation, such as CD 10 and BCL6 [117]. While BCL6 is highly expressed in the GCB 



subtype, it is rarely expressed in ABC DLBCL. This transcription repressor has many critical 
roles that contribute to innate and adaptive immunity. Under normal physiological conditions, 
germinal centre B-cells share some characteristics of tumor cells including that they proliferate 
rapidly, evade growth checkpoint controls, and tolerate ongoing genomic instability [118-120]. 
During the humoral response, BCL6 functions as a master regulator of the GCB phenotype. 
Germinal Centres emerge in the secondary lymphoid organs upon B-cell activation and provide 
the setting for massive clonal expansion and immunoglobulin somatic hypermutation (SH). 
Immunoglobin SH is an adaptive measure that allows mutations in the variable regions of 
immunoglobin genes to provide more antibody affinity maturation [See review by 118]. 

Although this is essential to permit immunoglobulin mutagenesis and maturation, germinal 
centre B-cells are prone to malignant transformation. B-cells that generate high-affinity 
antibodies are then selected to undergo terminal differentiation to memory cells or long-lived 
antibody-secreting plasma cells [see Figure 8.4.4; 118]. 

In contrast, the ABC subtype has significantly worse survival and is characterized by a 
distinct genetic background that includes the t(3;14)(q27;q32) translocation, trisomy 3, deletion 
of the 1NK4A-ARF locus, BCL2 amplification and constitutive activation of the nuclear factor 
kappa B (NF-kB) pathway with high expression of NF-kB target genes [117, 121]. Interestingly, 
both GCB and ABC express BCL2 (oncogene), although there is a four fold higher expression in 
most ABC DLBCLs compared with GCB DLBCLs [113, 121]. In addition, ABC lymphomas 
arise from cells that have completed SH and therefore contain static immunoglobulin heavy 
chain variable region sequences [122], ABC DLBCL centers on MYD88, the key-signaling 
adapter in the Toll receptor pathway. Somatic mutations in the MYD88 TIR domain occur in 
39% of ABC DLBCL tumors, and are the most frequent oncogenic lesion described in this 
subtype [123]. 

Canine DLBCL is an aggressive malignancy demonstrating significant overlaps with the 
human disease and is considered curable in less than 10% of dogs [124], Just as in humans, 
DLBCL is the leading identified histotype (44.4%) and is of primary B-cell origin [82, 107]. One 
work explored the subtypes of DLBCL NOS in dogs, comparing their findings with human 
tumors. Similar to humans, differential expression split the groups into ABC-like from GCB-like 
[82]. Furthermore, these canine-specific "ABC/GCB" discriminating genes, while different from 
the human “ABC/GCB” gene list, were involved in the same pathways and processes (e.g., NF- 



kB signaling and B-cell receptor signaling). And using the human gene list, canine and human 
lymphomas clustered according to subtype, not species [82], Recently, NF-KB expression in 
DLBCL was compared using a principle components analysis and hierarchical clustering. 

Human and canine DLBCL grouped together and separately from the human and canine healthy 
tissue [53]. A clinical trial of dogs showed that using a peptide that effectively blocked 
constitutive NF-kB activity in vitro markedly reduced the mitotic index in 50% (3/6) of the ABC 
DLBCL canine participants [125]. 

Interestingly, BCL6 (which is highly expressed in GBC DLBCL) is rarely expressed in 
either canine DLBCL subtype and this has been confirmed by others [82, 125, 126]. Collectively, 
these studies identify molecular similarities in human DLBCL that suggest pet dogs are a highly 
representative model of DLBCL for future studies, including therapeutic clinical trials [82], It 
also provides evidence that dogs do have unique aspects of tumor biology that, as our 
understanding develops, will improve knowledge of cancer initiation, progression, and 
maintenance. In addition to the BCL6 difference in expression, clinical case studies abound with 
variations in the traditional presentation of lymphomas in dogs, only highlighting the complexity 
of this disease and potential for dogs as a model [127-130]. Further, findings in canine DLBCL 
can provide new targets for the human disease. For instance, tissue factor pathway inhibitor 2 
(' TFPI-2 ) is a tumor suppressor involved in invasiveness inhibition that has been explored in 
many cancers, but not lymphomas [124], Recently, a study that looked at epigenetic silencing in 
canine DLBCL found high-frequency epigenetic dysregulation of TFPI-2 that correlates with its 
reduced mRNA expression [124]. This provides one clear target for human analysis. 


8 . 43 . 2.2 

Osteosarcoma 

Dogs have served as an epidemiological and pathological model of OS for decades. However, it 
is only more recently that the full potential of the canine application to human OS has been 
recognized - particularly in its relevance to genetics and biomarkers [comprehensively reviewed 
in 73], OS is a high-grade primary bone neoplasm of mesenchymal origin [131]. The parallels 
between canine and human OS are significant in their clinical presentation, biologic behavior, 
histology, conventional treatments, and shared biological targets for investigational treatments 
[132], In humans, primary bone tumors are rare and account for only 0.2% of all malignant 



tumors [133]. OS has a bimodal age distribution that peaks at 15 to 19 years and ~70 years [16] 
with approximately 60% of tumors occurring in patients under 20 years of age, and -10% in 
patients older than 60 years [134], For the majority of cancer diagnoses, 77% are made in 
persons > 55-years of age [135]. However, OS is the third most frequent cause of cancer in 
adolescents and represents over 56% of all bone tumors. The highest incidence in the US occurs 
in African American and Hispanic/Latino adolescents, and the lowest incidences are in non- 
Hispanics of European descent, Asians and Pacific Islanders [136]. Encouragingly, although the 
incidence of OS has not changed appreciably since 1975, the overall mortality has decreased 
[136]. Children aged birth-19 years diagnosed with OS from 1975-1979 had a 5-year survival 
rate of 45% [136]. With experience and multiple clinical trials, a chemotherapeutic regimen was 
defined that resulted in an increase in overall survival to 71% of those diagnosed between 2003- 
2009 [136, 137]. Now, those who survive 5-years have a 90% chance of surviving to at least 15- 
years [136], However, new regimens in humans have failed to further improve outcomes since 
1987 [137]. Thus, the canine model of OS is poised to help develop new therapeutic targets and 
improve outcomes. Dogs have already had a significant impact on OS as it was in dogs that the 
first limb-sparing techniques were developed [10] The incidence in dogs has been estimated to 
be at least 13.9/100,000 [10], but also reported as 52/100,000 [138] compared to actual US 
incidence of 0.947/100,000 in humans in 2014 [3,020 Cases/318,673,000 USA Population=0.947 
Cases Per 100,000 individuals; 139, 140, 141], 

A key clinical difference between humans and dogs is the peak onset of the disease in 
humans is at a time when growth is rapid, whereas in dogs, the peak incidence is later in life after 
closure of the growth plates [132]; however, a bimodal overlapping incidence is shared by the 
two species [10, 142], OS primarily affects the metaphyseal of the weight-bearing regions of the 
long bones of the appendicular skeleton (shoulder girdle, arms, legs, etc.) accounting for 72% 
and 94% of OS in dogs and humans with the small remainder of tumors arising in the axial 
skeleton [including the flat bones of the skull, ribs, vertebrae, sternum, and pelvis; 122, 131, 139, 
143-145]. Interestingly, proximal humeral location is a significant negative prognostic factor in 
both canine and human appendicular OS [73, 146]. Other prognostic factors associated with 
survival of OS shared by dogs and humans include tumor grade, tumor mitotic index, presence of 
metastasis, the use of adjuvant and/or neoadjuvant chemotherapy, postoperative infection at 
limb-sparing surgical sites, age of onset, and high serum alkaline phosphatase (ALP) and lactase 



dehydrogenase (LDH) [73, 142, 143, 146], In both species, a slight predominance in males over 
females has been reported [73, 142]. 

The exact etiology of OS remains obscure primarily because of the difficulty in isolating 
a pure population of osteocytes from residence within a mineralized matrix [now thought to be 
the mechanosensory cell in bone which has a major role in the regulation of bone formation and 
resorption; 147], Recent work suggests that osteocytes can serve as OS progenitors as identified 
by expression of dentin matrix phosphoprotein 1 ( DMP1 ; a marker of osteocytes) in multiple 
murine, human, and canine OS cell lines [148]. To assess the tumorigenic potential of osteocytes, 
an immortalized murine osteocyte cell line was injected into two sites on mice and both locations 
developed tumors consistent with OS [148], The strength of this etiology work will continue to 
be explored using multiple models, as will more traditional genetic research. These various 
approaches will lead to the validation of targets for clinical trials and pharmacogenetics. 

Although expression of canine and human OS had been previously considered separately, 
the first study to consider both human and canine OS expression together replicated former 
classifications of short/long-term survival in another set of dogs and in 5 independent human 
datasets [52, 149]. Because of these similarities, dogs make an outstanding pre-human trial 
clinical model for the targeting of pharmacogenetic therapies [150]. For example, a potential 
clinical target is Survivin, a gene whose expression is elevated in human and canine cancer [151]. 
This gene is a member of the inhibitor of apoptosis (IAP) gene family, which encodes negative 
regulatory proteins that prevent apoptotic cell death [152], Survivin expression has been reported 
to be an indicator of poor prognosis, low apoptotic index, poor differentiation, high proliferation 
activity, and enhanced resistance to cisplatin-mediated chemotherapy [153]. Survivin attenuation 
in canine OS cells demonstrated inhibition of cell-cycle progression, increased apoptosis, mitotic 
arrest, and chemosensitivity to significantly improve in vivo tumor control [154], 

Another clinically relevant target is the Notch signaling cascade. This pathway is crucial 
for the development of multiple organ systems including bone development and both osteoblasts 
and osteoclasts [155]. In human OS, the Notch target gene HES1 is associated with cell 
proliferation, invasion and metastasis, and has been suggested to be a prognostic biomarker 
[156]. Some of these findings were replicated in dogs, but interestingly, there appears to be 
reduced HES1 expression despite elevated expression of other Notch signaling targets in the 
most aggressive OS tumors [157]. These findings indicate that HES1 expression, while a good 



prognostic indicator, may not be an appropriate surrogate marker of Notch signaling [157], Such 
findings are invaluable to future translational work and might have been missed without use of 
the canine model of OS. Thus far, shared aberrant gene expression in both canine and human OS 
includes p53, RBI, PTEN, HER-2, MET, STAT-3, mTOR, ezrin, PDGFs/PDGFRs, MMP2/9, 
miR-134, miR-544 [158-168, for a list of developmental pathway abberations in OS see 169]. 

OS in companion animals is generally considered a disease of large and giant breed dogs 
[63, 170]. A hallmark study of canine OS in 1966 reported an OR of 185 for giant breeds (> 36 
kg), and an OR of 13 for medium sized dogs (18-36kg) when compared to dogs weighing <10kg 
[171]. A follow-up study in 1998, conducted on 1462 cases of canine OS, found that -30% of all 
OS cases were attributed to dogs weighing > 40 kg and only 5% of their tumors occurred in the 
axial skeleton [17]. Coupled with this, only 5% of OSs occurred in dogs weighing <15 kg, but 
-60% of their tumors originated in the axial skeleton [17]. This suggests that determinants of OS 
may differ for large and small breeds, although increasing height and weight are the most 
predictive factors for OS in dogs [172]. Interestingly, height and weight have been found to be 
significant risk factors in human OS as well [145, 173]. In fact, compared to average birth- 
weight subjects (2,665-4,045 g), individuals with high birth-weight (> 4,046 g) had an increased 
risk of OS (OR 1.35) as did individuals >51 st height percentile (OR 1.35) [174], 

Insulin-like growth factor (IGF1) and Insulin-like growth factor 1 receptor (IGF1R) have 
been associated with height and weight. It was shown that IGFI null mice are 40% smaller than 
littermates, while IGF1R null mice are approximately 55% smaller and die at birth [175]. In 
dogs, a non-synonymous SNP (chr3:44,706,389) in IGF1R changes a highly conserved arginine 
at amino acid 204 to histidine and is predicted to prevent formation of several hydrogen bonds 
within the cysteine-rich domain of the receptor's ligand-binding extracellular subunit [176]. This 
SNP is associated with breed average height at withers of < 10 inches and -10 lb weight [34, 

176] and deletion or mutation in humans causes severe short stature or non-syndromic children 
that are small for their gestational age [177-179]. Further, a single IGF1 SNP haplotype is 
common to all small breeds and nearly absent from giant breeds, with the size of different breeds 
correlating with reduced IGF I plasma levels [180, 181]. 

IGF1R expression in human OS has been associated with tumor metastasis and poor 
prognosis, and represents an appealing therapeutic target [182], Investigation of IGF1R 
expression in canine OS tissues and cell lines showed that IGF1R was expressed in 71% of the 



samples and that dogs with higher levels of IGF1R expression (47% of cases) had significantly- 
decreased survival when compared to dogs with lower IGF1R expression [183], The expression 
of the 1GF1R gene is negatively regulated by a number of transcription factors, including the 
WT1 and p53 tumor suppressors that are known to be aberrantly expressed in both canine and 
human OS [184]. IGF1 has also been associated with a genetic susceptibility for OS [185]. The 
role IGF1 and IGF1R play in growth, development, and OS is clearly important but still unclear 
[reviewed in 180]. This evidence suggests a crucial role for genes that govern growth in OS 
development. The dog provides a powerful model from which to uncover these relationships; for 
example, the average weights of Chihuahuas and English Mastiffs differ by 65-fold. 

As discussed above (8.4.2.2), the first GWAS’s for canine OS susceptibility were 
recently conducted on the Greyhound, Rottweiler, and Irish Wolfhound [42], A total of 33 loci 
were identified and top candidates in each breed included prominent cancer genes such as 
CDKN2A/B, AKT2 and BCL2. The top Greyhound candidate locus was fine-mapped to 15kb 
between CDKN2B-AS1/ANR1L and CDKN2AJB. An enhancer screen in the human OS U20S cell 
line then narrowed the causative variant to a highly-conserved single nucleotide predicted to lie 
within a PAX5 binding site. The latter transcription factor offers an excellent candidate 
mechanism as it is a known regulator of bone biology. Fortuitously, the causative variant is the 
peak SNP marker in the Greyhound GWAS, and thus much evidence already exists for its allele 
frequency in diverse breeds. It is fixed in Rottweiler (0.97 frequency) and Irish Wolfhound 
(0.95), but present in lower frequencies in a panel of 28 different breeds. In a group of eight large 
(a major OS risk factor) and high OS-risk breeds (Rottweiler, Irish Wolfhound, Leonberger, 
Pyrenees, Mastiff, Labrador Retriever, Great Dane, and Golden Retriever), the risk allele is 
slightly more common in cases. In the Leonberger and Great Pyrenees it is weakly correlated 
with OS, but no association was found in the Mastiff, Labrador Retriever, Great Dane or Golden 
Retriever. Curiously, the OS risk variant is present in approximately 50% of several dozen 
breeds studied to date, but has not yet been detected in the more limited numbers of wolves that 
have been genotyped to date. As the CDKN2A/B locus is a pan-cancer tumor suppressor, it will 
be interesting to learn whether the dog risk variant affects OS or broad cancer risks in mice. This 
study hints at the power of combining germline and somatic genetics to identify pathophysiology 
of dog cancer. This type of deep biological understanding will lead to the development of 
targeted therapies and their translation in the same species. 



8 . 4.4 

Preclinical and veterinary translational investigations in dogs with cancer 
8 . 4 . 4.1 

Preclinical investigations in dogs with spontaneous cancer 

The development and approval of new cancer drugs is a lengthy, costly, and frequently 
unidirectional process [186, 187], Novel agents are assessed in conventional preclinical models 
of efficacy and toxicity prior to moving into human clinical trials; however, most new cancer 
drugs that enter human clinical trials fail to reach approval, largely because of unanticipated 
toxicity or lack of efficacy that was not predicted in rodent models of cancer [188, 189]. Based 
on our knowledge of the complexity of cancer, it is not surprising that many models fall short of 
being predictive. Translational studies investigating new drugs, devices and imaging techniques 
in pet dogs with cancer can overcome many of these shortcomings, and assist in the transition 
between conventional preclinical models and human clinical trials. 

Mouse models of cancer have proven to be excellent tools for dissecting the biology of 
molecular pathways involved in cancer development and progression; however, they frequently 
do not truly recapitulate the biological features that define cancer in humans, including genomic 
instability and the heterogeneity of tumor cells within a complex microenvironment [190]. 
Furthermore, conventional mouse models fail to recapitulate the complex biology of cancer 
recurrence and metastasis integral to outcomes in human patients. Naturally occurring cancers in 
dogs have an intrinsic advantage as a model for human disease in that they mimic and represent 
biologically complex conditions in a way that is not possible using other animal models. In many 
cases, cancers in dogs are described in the same language as their human counterpart and can be 
classified according to histologic and/or clinical staging systems analogous to that used in human 
cancers [e.g. National Cancer Institute Working Formulation, World Health Organization 
histopathological classification and clinical staging system for domestic animals with lymphoma; 
191, 192], Given their large size, the evaluation of novel therapeutic approaches (drugs or 
devices) in pet dogs can answer important questions regarding relevant drug exposure that are 
often inadequately considered in mouse models. Additionally, the similarities between humans 
and dogs with respect to their size, tumor biology and anatomy provide an opportunity to 



engineer devices for limb sparing or prosthesis and optimize surgical interventions that are 
challenging to recreate in other animal model systems [51, 74, 193]. 

The ability to rapidly advance therapeutics for rare human malignancies such as pediatric 
OS is limited by the low incidence of these diseases in humans [194], In contrast, OS is at least 
ten times more prevalent in dogs, providing a significantly larger patient population in which to 
evaluate new treatment strategies. Importantly, pet dogs represent a large population size and 
their owners are highly motivated to seek out new treatment options for their pets, which provide 
a unique opportunity to sufficiently power clinical trials, including the assessment of new drugs 
[8, 51]. Serial tumor biopsies and repeated collection of body fluids (serum, whole blood, urine) 
from dogs before, during, and after exposure to an investigational agent allows for evaluation of 
clinical and biological endpoints (e.g., pharmacokinetics and pharmacodynamics) that can be 
linked to drug exposure, surrogate imaging or circulating biomarkers, and therapeutic response in 
ways that are often difficult or unacceptable in human trials [195, 196], Pet owners are often 
willing to permit autopsy, which is crucial not only for assessment of tumor control, but also 
treatment-related toxicity. Lastly, the naturally shorter lifespan of dogs compared to humans, 
coupled with short survival times achieved with current treatments for canine cancers accelerates 
the pace at which clinical trials in dogs can be conducted and allows for more rigorous 
evaluation before translation into new human trials [195]. The compressed course of cancer 
progression seen in dogs allows timely assessment of novel cancer therapies and permits less 
costly outcome determinations, such as time to metastasis, local recurrence and survival. 

Because no established gold standards exist for the management of cancer in dogs, the 
evaluation of novel therapeutics is possible in less advanced or less heavily pretreated pet 
populations, compared to human cancer patients participating in early-phase human trials. Unlike 
human clinical trials, investigational studies in dogs are not constrained by traditional Phase I, 
Phase II, and Phase III trial designs allowing new forms of treatment (especially investigational 
single-agent trials) to be offered to pet dogs prior to conventional therapies or during the period 
of minimal residual disease. This provides the unique opportunity to evaluate single-agent 
activity or combination therapies earlier in the drug development process. 

Organized cooperative efforts are now in place in the United States to facilitate the 
inclusion of pet dogs with naturally occurring cancer into the development path of new cancer 
drugs. The Veterinary Cancer Society and the Veterinary Cooperative Oncology Group have led 



efforts to encourage multicenter collaborative veterinary oncology studies and to enhance case 
accrual and facilitate clinical trials. Additionally, the Comparative Oncology Program (COP) of 
the National Cancer Institute (NCI) at the National Institutes of Health has established the 
Comparative Oncology Trials Consortium (COTC) to conduct rigorously controlled and focused 
preclinical trials of new cancer drugs intended to inform the design of human studies [8,51, 

197]. The COTC functions to facilitate the design and execution of clinical trials in dogs in 
collaboration with extramural academic comparative oncology centers, the pharmaceutical 
industry, and nongovernmental groups interested in cancer drug development. A multi- 
institutional Pharmacodynamics Core was established through the COTC to provide an 
infrastructure within the veterinary research community to support the development, validation, 
and assessment pharmacokinetic and pharmacodynamics end-points within COTC trials [198]. 
Finally, the Canine Comparative Oncology and Genomics Consortium (CCOGC) has established 
a national canine cancer biospecimen repository as a resource to facilitate comparative genomics 
and the identification of valid tumor targets in canine cancers to aid in preclinical drug 
development [51]. This national infrastructure is now able to i) support and facilitate the 
implementation and conduct of multi-institutional studies, ii) more directly engage the veterinary 
oncology community, and iii) respond to the needs of the pharmaceutical community to better 
inform the drug development path of new cancer drugs. 

8 . 4 . 4.2 

Conduct of preclinical and translational studies in pet dogs with cancer 

The value of including pet dogs with cancer into preclinical studies intended to support the 
development of human cancer treatment strategies is significant. Studies utilizing dogs with 
cancer can inform many different aspects of the human preclinical drug development process 
(see Fig. 8.4.5). Importantly, these studies are facilitated by the ability to use the “species in 
kind” approach. That is, these studies can be conducted with the knowledge of the drug toxicities 
and pharmacokinetics derived from studies performed in healthy animals (laboratory dogs) of the 
same species. Preclinical studies in tumor-bearing dogs can be used to address questions of 
toxicity and pharmacokinetics in patients with cancer, treatment schedule, pharmacodynamics 
assessment and endpoints, efficacy, and others [91, 92, 197]. 



Clinical trials performed in companion animals with the goal of informing human 
oncology are considered to be preclinical studies with respect to human drug development [199]. 
The term clinical trial has a broader definition in veterinary medicine and is used to describe any 
clinical research study that enrolls client-owned companion animals. Recent efforts by key 
opinion leaders from the pharmaceutical and biotechnology community, academia, and 
regulatory and federal agencies have proposed guidelines for the conduct and oversight of 
preclinical translational studies that include pet dogs with cancer intended to support the 
development of human cancer drugs or treatment delivery devices [197, 200, 201]. Non-human 
clinical studies that include pet dogs with cancer are designed and implemented with the humane 
care of the pet animal cancer patient as a primary consideration, with the informed permission of 
the pet owner, and under the guidance of an accredited institutional animal care and use 
committee (IACUC) and Clinical Trials Review Board (CTRB) [197, 200, 202]. Oversight 
provided by IACUC and CTRB ensures patient safety through protocol review before study 
initiation and monitoring patient outcome and protocol compliance during the course of a clinical 
trial. Additionally, the inclusion of a data safety monitoring board (DSMB) in veterinary clinical 
trials provides an additional layer of patient protection and assurance regarding appropriate study 
conduct and termination. Similar to a DSMB responsible for the oversight of a human clinical 
trial, the DSMB in a comparative oncology study has a role in reviewing adverse events that 
occur during the clinical trial and an interim analysis of trial efficacy [203]. In order to provide 
minimum standards for their conduct in dogs, studies are conducted using the guidelines outlined 
in the spirit of Good Clinical Practice (GCP). With the goal of providing assurance of credible 
study results, the GCP guidelines were developed by the International Conference on 
Harmonization to i) protect the welfare and rights of human clinical trial participants, and ii) 
define standards for trial design and conduct, data capture and analysis, and auditing and 
reporting of clinical trials [204]. 

Clinical translation in dogs is not constrained by the historic conventions of phase I, II 
and III studies. However, veterinary clinical studies contain many of the same components of 
human trials. For instance, informed consent and the designation of specific inclusion and 
exclusion criteria are included in veterinary clinical trials to ensure enrollment of proper patient 
populations and validate the integrity of the clinical data obtained. Central to the conduct of 
successful veterinary clinical trials is appropriate trial design, including adequate statistical 



power to answer the specific questions that are necessary for moving product development 
forward. For example, assessment of biological endpoints addressing the mechanism of action or 
therapeutic index, identification and validation of biomarkers, and the correlation of these 
endpoints with imaging and pharmacokinetics [92, 205, 206]. Later studies may prioritize anti¬ 
tumor activity against measurable tumors or against minimal residual disease and should be 
flexible in design so as to efficiently respond to new data generated both within and outside of 
the study. The active pharmaceutical ingredients (API) considerations for non-human clinical 
studies does not require good manufacturing practice (GMP) certification, but agents prepared 
for these studies are prepared sterile, endotoxin free, and of high quality and purity. Given the 
scientific and translational intent of such studies, the use of GMP quality agents will likely 
become more important to study sponsors as an agent progresses to and beyond the point of 
investigational new drug (IND) filing [197]. 

Within the practice of human oncology, standardized response criteria for malignant 
tumors allow for more consistent and meaningful comparisons of treatment protocols and 
outcomes. The World Health Organization established standardized criteria for the recording of 
baseline data relating to the patient, the tumor, laboratory and radiologic findings, the reporting 
of treatment, grading of acute and subacute toxicity, reporting of response, recurrence and 
disease-free interval and reporting results of therapy for clinical trials in human oncology [207], 
Subsequent documents establishing response criteria for malignant lymphoma and the human 
Response Evaluation Criteria in Solid Tumors (RECIST) have been published and are 
periodically updated, providing standardized criteria to assess therapeutic response that are now 
considered standard practice in human clinical trials [208, 209]. Given the increased number of 
prospective clinical trials performed in veterinary oncology and the role of spontaneous canine 
tumor models in the human cancer drug development process, standardized adverse event and 
grading criteria and guidelines to evaluate tumor response in dogs with peripheral nodal 
lymphoma and solid tumors have been established by the Veterinary Comparative Oncology 
Group (VCOG) [208, 210, 211], These guidelines have been modeled after similar human 
response evaluation criteria in an effort to provide consistency and accuracy of reporting patient 
response. Additionally, tools to assess health-related quality-of-life in dogs with cancer have 
been established to assess owner-perceived changes in dogs undergoing therapy, and to evaluate 



the impact agents that are administered for prolonged dosing periods have on overall quality of 
life [212], 

Guidelines for regulatory oversight and standards for reporting data from comparative 
oncology trials that include pet dogs intended to support the development of human drugs are not 
well-defined, but specific guidelines for the timing and nature of such reports is currently under 
discussion. Studies in tumor-bearing dogs are typically conducted at two points in time in the life 
of a new human cancer treatment, either before an IND is filed (i.e. pre-IND) or after an IND is 
filed (i.e. post-IND); however, the implementation of the study (including protocol development 
and design), IACUC, CTRB and data safety management oversight, and trial conduct are similar 
in both development settings. The filing of an Investigative New Animal Drug (INAD) through 
the FDA-Center for Veterinary Medicine (CVM) is necessary for the regulatory development of 
a new drug for use in the animal health market, but investigational agents intended for human 
use alone or for those agents not yet submitted for IND status may be treated similarly to other 
preclinical studies in traditional model species [197], 

Trial implementation guidance, including both IACUC and DSMB oversight, adequately 
addresses the questions of risk of the proposed studies to pet animals required for INAD filing 
and typically includes details on the API beyond those generally required by an INAD. Whether 
or not an INAD is filed for tumor-bearing dog studies conducted in the pre-IND setting, it is 
recommended that a full report and associated preliminary data documenting expected and 
unexpected adverse events should be maintained as part of the legacy of the agent under 
development. Unexpected adverse events that occur should be reviewed and addressed during 
the study by the sponsor, investigators, and DSMB. If the agent in question progresses through 
development, a final study report and associated data generated in tumor-bearing dog studies 
should be included in an IND application package. For new human cancer agents that are post- 
IND, data from a tumor-bearing dog study would become part of a reporting package for the 
agent being evaluated and regulations regarding adverse event reporting are provided by 
Investigational New Drug Application section 312.32 IND Safety Report [197]. Based on this 
criteria, any event that occurs in a study of a new human cancer agent conducted in tumor¬ 
bearing dogs that is either not serious or is expected, based on the protocol and informed consent, 
does not require expedited reporting. If an unexpected adverse event does occur in the setting of 
preclinical evaluation of a new drug in pet dogs, the inclusion of stopping rules that allow 



expansion of treatment cohorts may help determine if an unexpected event is reproducible. If this 
is the case, modifications to the study to alter eligibility and exclusion criteria or additions to 
monitoring strategies may help investigators better understand these adverse events [197]. 


8 . 4 . 4.3 

Examples of successful preclinical investigations in pet dogs with cancer 

Dogs have historically been useful, informative models in the preclinical development and 
discovery of novel cancer therapeutic strategies. Early studies evaluating the biological features, 
chemotherapeutic and surgical treatment of spontaneous tumors in dogs have provided guidance 
for conducting similar investigations in human patients and subsequently informed the 
management of human cancers [65, 74, 213], Studies in dogs are uniquely positioned to evaluate 
the efficacy and feasibility of novel drugs and drug delivery devices and can inform the go/no-go 
‘decision gate’ in clinical drug development. 

Examples of such efforts include comparative studies conducted in pet dogs with OS to 
evaluate the safety, efficacy, and feasibility of novel inhalation therapies in the management of 
macroscopic pulmonary OS metastasis [214, 215]. Similarities in the respiratory anatomy and 
tropism of OS metastasis to the lungs in humans and dogs provided rationale for early canine 
trials of inhaled cytokine and cytotoxic chemotherapy trials. Inhalation cytotoxic chemotherapy 
trials in dogs demonstrated that aerosolized therapies were well tolerated with no dose-limiting 
hematologic or biochemical toxicity and minimal histologic lung pathology. Furthermore, these 
studies supported the proposed mechanism of antitumor activity associated with this therapy in 
dogs and contributed to the clinical development of inhalation approaches in humans [60, 150, 
214-217]. Proof of concept studies evaluating inhaled cytokine immunotherapy in dogs 
established a safety and efficacy profile for inhaled liposomal IL-2 therapy, demonstrated 
evidence of local immunomodulatory effects, and provided support for subsequent early-phase 
trials of this novel treatment approach in humans with pulmonary metastasis [215, 216]. 

Comparative studies in dogs with cancer provide an opportunity to evaluate the feasibility 
and tolerability of surgical procedures or devices, interventional radiology techniques, and 
radiotherapy that are difficult to recreate with other animal model systems [51, 74]. Early studies 
in pet dogs with OS undergoing various limb-sparing surgical approaches provided meaningful 
data relevant to limb-sparing techniques, bone allograft antigenicity, implant loosening and 



fracture, and durable allograft healing in people [58, 218-220], Similarly, clinical studies 
assessing image-modulated radiation therapy (IMRT) and tomotherapy technology in pet dogs 
with cancer provided important information regarding device utility and accuracy prior to its 
widespread use in human patients [221, 222], More recent studies in dogs with nasal and bladder 
cancers treated with IMRT have explored the dosimetric impact of daily setup variations during 
treatment and dose escalation strategies; and they provided proof-of-principle that conformal 
avoidance radiotherapy can decrease the incidence of acute and late toxicity to surrounding 
normal tissues [57, 223-225], 

In contrast to rodent models of cancer, dog cancers are more akin to human cancers in 
terms of physiology and metabolism for most organ systems and drugs. Additionally, 
spontaneous canine cancers occur in the presence of an intact host immune system and possess 
natural tumor heterogeneity, stroma, and vasculature [51, 195], To this end, the inclusion of 
tumor-bearing dogs in high-resolution comparative imaging and cross-species validation studies 
may provide data better describing off-target tracer localization, biodistribution, and key 
complex intracellular processes such as hypoxia, DNA proliferation, and glucose metabolism 
[226-231], For example, multimodality functional imaging studies performed in dogs with 
spontaneous sarcomas and carcinomas evaluated the uptake of a novel hypoxia specific 
radiotracer [ 61,64 Cu]copper(II) diacetyl-di(A^-methylthiosemicarbazone) (Cu-ATSM) and 
(18)FDG in conjunction with paired pimonidazole hypoxia immunohistochemistry. This study 
demonstrated a strong positive correlation between the distribution of pimonidazole staining and 
uptake of (18)FDG and (64)Cu-ATSM, providing support for future clinical investigations 
evaluating the suitability of functional biotracers for in vivo imaging and radiotherapy target 
definition in solid tumors and their potential as radiotherapeutic agents [232], Subsequent 
preclinical studies investigating intratumoral uptake of (18)FDG, (18)FLT, and Cu-ATSM in 
canine sinonasal tumors provided meaningful data on histology-specific positron emission 
tomography (PET) correlations, and on the spatiotemporal stability of tracers during radiation 
therapy treatments (see Fig. 8.4.6) [233, 234], These findings may have a significant impact on 
clinical dose painting strategies used to define radiation therapy dose escalation. 

More recently, efforts to include dogs in the preclinical modeling of personalized 
medicine have evaluated the feasibility of conducting molecularly-guided analysis of tumors 
from dogs with naturally occurring cancers in a clinically relevant setting [235]. This proof-of- 



concept study demonstrated that the collection and turnaround of canine tumor samples, 
centralized pathology, molecular profiling and bioinformatics analysis matching gene expression 
to therapeutic options is achievable in a practical clinical window (<1 week). Furthermore, as 
observed in human personalized medicine trials, tumor gene expression signatures in dogs 
clustered by cancer type, whereas patient-to-patient heterogeneity resulted in drug predictions 
that were uniquely patient defined. This study serves as rationale for dogs with spontaneous 
cancers as a clinically relevant comparative model for optimizing the delivery of personalized 
medicine strategies and translating this to human patients. 

Naturally occurring cancers in pet dogs more accurately model the complex biology of 
the cancer microenvironment, including antitumor immunity and angiogenesis [103, 236-242]. 
Experimental mouse tumor models, including transplantable models, genetically engineered 
tumor models, and humanized mice have provided key mechanistic insights into host antitumor 
immune responses that have guided the development of novel immunotherapeutic strategies in 
human cancer patients [243, 244], Canine cancers, however, develop in the context of an intact 
immune system that recapitulates the considerable heterogeneity that exists in the human 
immune system. To this end, several immunotherapy trials in dogs have evaluated autologous 
vaccine strategies [245, 246], biological response modifiers [247-250], passive immunotherapy 
[251, 252], adoptive T-cell therapy [253] and have explored their feasibility and immunologic 
efficacy when used in combination with dose-intense chemotherapy [248, 249]. 

Early studies evaluating the efficacy of novel DNA-based vaccine strategies for 
melanoma in the setting of minimal residual disease in dogs were initially intended to inform 
human studies; however, these trials ultimately provided safety data required by the United 
States Department of Agriculture (USDA) for the approval of a veterinary biological vaccine 
(ONCEPT™ Canine Melanoma Vaccine, Merial), the first and only USDA-approved therapeutic 
vaccine for the treatment of cancer in either animals or humans [254-256], Initial clinical 
evaluation of the anticancer immune effects associated with the administration of liposome- 
encapsulated muramyl tripeptide-phosphatidylethanolamine (L-MTP-PE) conducted in dogs with 
OS in the setting of minimal residual disease demonstrated single-agent anticancer activity [257]. 
Subsequent randomized, placebo-controlled clinical trials of L-MTP-PE conducted in 
conjunction with standard-of-care chemotherapy in dogs [258] served as scientific rationale for 
phase III evaluation of L-MTP-PE in pediatric OS. Findings of a Children’s Oncology Group 



clinical trial were similar to those reported in the initial canine studies and demonstrated that the 
addition of L-MTP-PE to standard chemotherapy in pediatric OS significantly improved overall 
survival at 6 years [259, 260]. Based on these findings, L-MTP-PE (mifamurtide, Mepact®) was 
approved for the treatment of metastatic OS by the European Medical Association. This 
highlights the utility of pet dogs with spontaneously occurring cancers for the investigation of 
novel immunotherapeutic agents, and to define the activity of these therapies in the 
micrometastatic disease setting. 

Similarly, the development of anti-angiogenic or vascular targeted agents has been 
complex and with disappointing results from studies in tumor-bearing mice and human cancer 
patients [261], Since neither tumor nor tumor vasculature are present in healthy animals (e.g. 
purpose-bred research dogs), the integration of tumor-bearing dogs in trials using anti-angiogenic 
agents that are under development for humans can provide a strong preclinical basis for first-in¬ 
man studies in human cancer patients. Evaluation of the targeted delivery of TNFa with an 
adeno-associated virus phage (RGD-A-77VF) to tumor and tumor-associated vascular 
endothelium in pet dogs with spontaneous cancer helped define the safety, selective tumor- 
specific localization, and efficacy of RGD-A -TNF targeting to tumor vasculature [262], Serial 
tumor and control biopsies taken before and after the administration of RGD-A -TN established 
adeno-associated virus phage homing to disease-related vasculature and tumor-associated 
vascular expression of TNFa and correlated these findings with drug exposure and RECIST- 
based objective responses in several tumor types. Similarly, evaluation of anti-angiogenic 
thrombospondin-1 (TSP1) peptide mimetics (ABT-526, ABT-510) in pet dogs with cancer 
conducted in parallel with human clinical trials provided guidance on the single-agent drug 
activity, drug levels and prolonged exposure duration required for anti-tumor activity [263]. This 
emphasized the importance of evaluating these agents in novel trial designs and in combination 
with other anticancer agents. Subsequent studies have explored anti-tumor efficacy of alternative 
drug formulations of TSP1 peptides in dogs with soft tissue sarcomas, and the cooperative 
activity between cytotoxic chemotherapy and TSP1 anti-angiogenic therapy in dogs with 
lymphoma [264, 265]. 

The integration of studies that include pet dogs with cancer into the development path of 
new cancer drugs is becoming more common and is expected to increase as part of innovative 
drug development. The addition of comparative oncology studies in the preclinical setting has 



the potential to improve the development path of new cancer drugs by identifying and removing 
drugs earlier that are less likely to succeed in human phase I trials [195, 197]. Comparative 
studies in dogs performed during or after phase I studies may provide meaningful data on 
pharmacokinetic/pharmacodynamics end-points, classifying responding patient subsets, and 
identifying optimal drug combinations that may eliminate inactive drugs prior to advancing to 
human phase II clinical trials, or optimize the design of those trials. Additionally, such studies 
conducted in the adjuvant or minimal residual disease setting may prioritize those agents most 
promising for phase III development [197, 201]. Collectively, the elimination of inferior drugs 
earlier in the drug development pathway will reduce the number of drugs entering later phases of 
drug development and increase the overall success rate of phase III trials. This will substantially 
decrease the risks and costs of drug development. 

To this end, studies in dogs with cancer can aid in establishing relationships between a 
cancer target, its modulation with a novel small-molecule inhibitor, and clinical benefit. 
Correlative studies that would be challenging to conduct in humans, including multiple biopsy 
and other biospecimen collection time points, are readily feasible in pet dog studies. A 
prospective dose escalation study of rapamycin in dogs with OS was performed to define optimal 
dosing schedules, assess biomarkers, and provide the rationale for the use of rapamycin, or 
potentially other mTOR inhibitors, in OS [198], Pre- and post-treatment biopsies and peripheral 
blood mononuclear cells (PBMC) were collected and 48-hour whole blood sampling was 
performed to establish a pharmacokinetically-relevant and pharmacodynamically active dose of 
rapamycin in dogs with OS. Data from this phase I trial demonstrated that biologically effective 
concentrations of rapamycin were safely obtainable in dogs and provided evidence of target 
modulation in tumor tissues and PBMCs. This study highlights the advantage of integrating the 
comparative approach in the development path of new cancer drugs. Importantly, such studies 
help to establish critical pharmacokinetic and pharmacodynamic relationships so that drugs with 
an unfavorable therapeutic index or inferior target modulation attributes may be identified and 
removed from development earlier in the process [195, 196]. That thus improves the 
identification of the agents most likely to succeed in human clinical trials. 

This comparative approach also can have a significant impact on informing the 
development and conduct of later-stage studies in humans. Heat shock protein 90 (HSP90), a 
molecular chaperone that promotes the conformational maturation and stabilization of a wide 



variety of client proteins, is a promising target for therapeutic intervention in cancer. Ganetespib, 
a novel small molecule inh i bitor of HSP90, and its water soluble prodrug STA-1474 
demonstrated activity against canine OS and malignant mast cell lines in vitro and induced tumor 
regression, apoptosis and downregulation of key targets including MET and AKT in OS 
xenografts [266, 267], Based on these findings, a phase I study of STA-1474 was performed in 
dogs with spontaneous tumors [205]. This clinical trial demonstrated biological activity in 
various canine cancers (including dogs with metastatic OS and malignant mast cell disease), 
established safety and toxicity profiles, and provided important information regarding expected 
gastrointestinal adverse events that were subsequently observed in human clinical trials with 
ganetespib. Pharmacodynamic endpoints provided evidence of target modulation in PBMCs and 
established this as a reliable biomarker of drug activity. Additionally, pharmacokinetic analysis 
in this study provided information on drug levels and exposure duration that subsequently 
established dosing schemes that were unanticipated prior to the start of the clinical trial. These 
data laid the groundwork for the current clinical evaluation of ganetespib in humans. 

Additional examples exist that highlight the use of dog models to inform preclinical drug 
development, including agents targeting pathways associated with invasion and metastasis [268]. 
Large animal spontaneous cancer models compliment the use of both existing rodent models and 
human clinical trials. Their inclusion in preclinical and translational studies will facilitate the 
rapid intermediate evaluation of agents prior to or after early human trials. This will result in a 
more informed and optimal cancer drug development pathway. 


8.4.5 

Necessary developments for realizing the potential of canine models 
8.4.5.1 

Epidemiology, longitudinal cohorts, tissue repositories and integrative genomics 

Great efforts have been made to put canine models on equal footing with other investigational 
animal models. Much of that work has focused on creation of genetic resources, validation of 
human relevance and development of resources for clinical trials [9, 195, 196], Here we note 
several areas that are likely to have a high impact on biomedical canine models [7]. First, the gap 
in epidemiology of common diseases in dogs should be addressed. Specifically, longitudinal data 
in veterinary registries is necessary for both epidemiological and translational research [195, 196, 



269]. There are many examples of early efforts in this [270-273, see p. 55 of 274], In terms of 
utility for understanding the common diseases of complex genetics, such resources could stress 
this aspect in dogs and cats. The information would be most useful if it was, at least in part, 
associated with biological samples. That could lead to powerful databases for integrative analysis 
of clinical, environmental and molecular (genetic, epigenetic and other ‘omics) data in select 
breeds. One major limitation to these goals is the lack of standardized veterinary records. The 
creation of national or international standards, and ideally a common electronic veterinary 
medical record system, would be major advances. While there are individual academic and 
national veterinary tissue banks (http:// .ccogc.net/) , these could be developed further and 
combined with electronic veterinary health records. It is generally challenging to acquire 
complete dog pedigree data, but this can be invaluable in genetic studies. An ideal longitudinal 
cohort should make it possible to gather the following data in electronic form: clinical, 
environmental, pedigree, genetic and molecular phenotypes. 


8 . 4 . 5.2 

Improved genome annotation and development of key research areas 

The quality and completeness of the canine genome assembly is among the highest of any 
mammalian genome sequenced to date [23, 275]. However, compared to the human genome 
annotation, the dog’s is in its infancy. Many dog genes are not annotated, but can only be found 
through mapping of genes from other species. This often requires user verification to determine 
gene identities, gene structure, etc.; and in many cases there is little or no information on mRNA 
expression and splicing in dogs. Some of the necessary improvements could be done 
computationally, but that would require significant manual effort and would still require 
experimental confirmation. The most effective approach may thus be to conduct 
multidimensional genomic surveys as were done for SNPs (to create the most recent SNP 
platform [276]) or breed-specific CNV maps [40, 41]. The use of canine models could be thus be 
accelerated by expanding genomic annotation for individual breeds of high interest. Among the 
data that would be most useful are haplotype maps and whole genome sequences. Other high- 
value information is tissue-specific genomic data, beginning with RNA expression (underway; 
[275]) and DNA methylation (and in the future adding other genomic dimensions as is being 
done in humans [277]). 



Because dogs are not commonly used as experimental models in the way that, say, 
zebrafish, frogs, chicken and mice are, their molecular genetics/biology are not well developed. 
Going forward, it will be important to strengthen these aspects in order to capitalize on the rich 
genetic findings. For instance, the alpha and beta globin gene clusters that encode the subunits of 
hemoglobin are among the best-characterized gene-cluster regions across vertebrates and 
mammals. However, those genes are not annotated in the most recent dog genome assembly, and 
they cannot be readily discerned by visualizing the human genes that map by homology. This 
means that investigators interested in comparative blood studies in dogs either have to ignore 
hemoglobin gene regulation and protein sequences (or study it blindly without that information) 
or characterize the gene structure, protein sequences, and gene cluster regulation before 
undertaking those studies. Some of the most important areas that can dramatically raise the value 
of dog models are immunity, metagenomics, metabolism, thyroid biology, obesity, aging and 
stem cell biology. Of these, the one where dogs may have the most promise is aging. Because of 
the dramatic range in longevity across breeds, dogs may represent the best animal in which to 
study aging [278]. Moreover, canine longevity correlates inversely with size across dog breeds 
and has clear associations with reproductive traits and mortality/cause of death (including 
infection and cancer) [170, 279-282], Dogs may thus offer the best opportunity to dissect apart the 
genetics of growth and aging, and to elucidate their relationships to longevity and 
mortality/cause of death (including from cancer). With regards to cancer, it will be important to 
expand research in areas that include metastasis, tumor microenvironment, and resistance to 
chemotherapy. Lastly, there are several technological aspects of canine models that could be 
further developed to enable their common use in translational veterinary studies. These include 
gene therapy, antisense nucleic acids (which can be used to knockdown gene expression, for 
exon-skipping, or for blocking specific long non-coding RNAs to, in effect, increase gene 
expression), genome editing, and stem cell therapies. 


8 . 4 . 5.3 

Opportunities for understanding the complete biology of spontaneous cancers 

Comparative oncology approaches open unique and unparalleled opportunities to potentially 
identify targets for therapeutic intervention in canine and human cancer. This effort will require 
increased rates of canine tumor archiving and genetic-profiling [275, 283]. Many examples 



demonstrate how canine genetics can reveal cancer subtypes associated with clinical outcomes in 
dog and human patients [52, 79-81, 106]. To translate these findings more globally into the 
cancer drug development pathway, the establishment and molecular characterization of canine 
tumor cell lines from a variety of tumor types is needed to validate molecular target expression 
and provide preliminary assessment of the biological effects of therapeutic agents in vitro. Data 
validating the expression of molecular targets in primary canine tumor samples will provide 
rationale for potential therapeutic benefit and demonstration of biological activity of a 
therapeutic agent against tumor cell lines will support the subsequent in vivo evaluation of novel 
cancer agents in dogs with spontaneous cancers. 

Spontaneous cancers in dogs model the biological complexity of human cancers, 
including the complex interaction of the immune system with tumor cells and the development of 
chemotherapeutic resistance and metastasis that is integral to outcomes in both humans and dogs. 
For example, naturally occurring OS in dogs serves as a useful large animal model to study the 
biology of metastatic progression and identify potential targets that are linked to the pathogenesis 
of micrometastatic progression [72, 201]. Tissues from metastatic lesions and matched primary 
tumors from human patients are not widely available at this time; however, expanding existing 
biospecimen efforts to collect clinically annotated tissues from dogs with OS throughout the 
course of disease presentation and progression may be feasible given the high incidence and 
aggressive metastatic behavior of this disease in dogs. This effort may provide a broader 
understanding of the development of metastasis in OS through candidate drug target expression 
profiles and their dynamics during metastatic progression. Similarly, the identification of 
populations of cell that have the ability to suppress antitumor immune responses have been 
challenging to dissect in animal models; however, components of the tumor microenvironment, 
including regulatory T cells (Tregs) and myeloid-derived suppressor cells have been 
characterized in healthy and cancer-bearing dogs [239, 284, 285]. Furthermore, the clinical 
significance of various components of the tumor microenvironment has been recognized in 
several cancer types in dogs, providing additional support for the notion that systemic antitumor 
activity plays an important role in the pathogenesis of naturally occurring cancers in dogs [103, 
242, 286]. The development of spontaneous cancers in dogs provides the opportunity to study 
tumor progression in the context of an intact immune system where tumor, host, and tumor 
microenvironment are interacting intimately with one another. To this end, studies evaluating 



metronomic chemotherapy strategies in dogs to halt or slow tumor progression may help define 
their proposed mechanisms of action, including anti-angiogenic or antitumor immune responses 
and identify pharmacodynamics markers of effective therapeutic exposure and target modulation 
in tumor and surrogate tissues [240, 241], 


8 . 4 . 5.4 

Development of high-impact programs in preclinical cancer studies 

Historically, the widespread use and integration of comparative studies was hindered by the lack 
of infrastructure to coordinate the necessary parties: veterinary and human oncology, drug 
industry and centralized biospecimen repositories. The organization of collective groups 
(including the COTC, CCOGC, and Veterinary Cooperative Oncology Group) now provides the 
infrastructure and resources to support studies of pet animals with cancer that are increasingly 
“integrated” into the developmental path of new cancer treatments. For these studies to be most 
useful, they must be designed so as to ask important and well-defined biological questions that 
cannot be easily or fully answered by conventional preclinical models or in early human clinical 
trials [197]. The inclusion of a multi-disciplinary collaborative team (including basic scientists, 
veterinarians, and physicians) to rigorously review such investigations is necessary to ensure that 
questions are prioritized and efficiently answered. To optimize the translation of findings in the 
canine model, preclinical studies conducted by the COTC involve a multicenter collaborative 
network of veterinary teaching hospitals to ensure that they are adequately powered with sample 
sizes sufficient to detect drug effects with the endpoint parameters used. Incorporation of 
standardized response criteria (RECIST) and adverse event reporting in preclinical trials has 
facilitated the comparison of current and future treatment protocols used for companion dogs 
with cancer. Novel electronic reporting systems designed to provide real-time data capture are 
being successfully used in canine studies and it is anticipated that this method of data reporting 
may be incorporated into larger, multi-institute investigations to encourage discussion among 
study investigators during all phases of the investigation [51]. Importantly, translational studies 
are optimally designed and developed to include study endpoints that are specifically aligned 
with the design and development of studies in other preclinical species and human studies, 
thereby expanding the totality of information available for an agent and complementing data 
from other preclinical models that will optimize the drug development pathway. 



As part of the broader field of comparative oncology, translational drug development 
studies in dogs with cancer have classically evaluated pharmacokinetic and pharmacodynamic 
endpoints to define dose and schedule for therapeutic agents. Preclinical studies in dogs 
evaluating novel small molecule inhibitors have established relationships between drug exposure 
and target modulation; however, changes in target expression within peripheral blood 
mononuclear cells following drug treatment have aided in the discovery of surrogate biomarkers 
that may predict biological response [198, 205]. To complement this effort, the expansion and 
integration of molecular imaging techniques in tumor bearing dogs may provide critical new data 
regarding surrogate imaging endpoints during the process of drug discovery and development. 
Comparative imaging studies in dogs with cancer have established proof-of-concept for 
radiolabeled tumor-targeted peptides, and provided validation data for novel PET 
radiopharmaceutical agents, and have demonstrated the feasibility of serial PET/CT imaging to 
monitor response to cytotoxic chemotherapy or small molecule inhibitors [226, 287-290]. 
Diagnostic imaging studies in dogs have largely focused on improved methods for early 
determination of response to therapy or prognostication; however, alternative imaging modalities 
such as optical imaging of tumor tissue beds to detect minimal residual disease at the time of 
surgery have been explored in dogs with solid tumors to inform their development for humans 
[291, 292]. The incorporation of dogs in such comparative imaging studies designed to validate 
and optimize novel imaging techniques will be critical to imaging agent development and in the 
drug development pathway. 

Lastly, the opportunity to conduct studies in dogs in the setting of microscopic or 
micrometastatic disease is a unique and powerful advantage of the canine model [51, 196, 258, 
293, 294], The advantage of including pet dogs with cancer in the development of drugs that 
target metastatic progression in OS has been recognized and more recently, consensus guidelines 
to establish a preclinical translational pathway to value and prioritize potential therapeutic agents 
uniquely targeting the metastatic phenotype have been proposed [201]. A major effort of 
comparative oncology has been to identify shared targets in human and canine OS. Substantial 
progress has been made through experimental and preclinical investigations to identify 
dysregulated intracellular signaling pathways likely contributing to OS metastasis. For example, 
data from in vitro and murine xenograft studies have identified signal transducer and activator of 
transcription 3 (STAT3) and the membrane-cytoskeletal linker ezrin as relevant targets for 



therapeutic intervention in human and canine OS and support the current development of small 
molecule inhibitors of STAT3 and ezrin for the treatment of OS [83, 165, 295-297], The 
inclusion of dogs with spontaneously occurring OS in the development and testing of novel 
therapeutics that target STAT3 and ezrin biology in the micrometastatic disease setting will 
likely provide important new information regarding efficacy and optimization of different doses 
and schedules with direct relevance to future testing in people. Importantly, while druggable 
targets linked to metastatic progression of cancer have been identified, many of these targets and 
associated processes seem to influence the progression of metastatic cells from microscopic 
disease to that of grossly detectable lesions. Therefore, it is unlikely that many of these agents 
will have a measurable effect on established and grossly detectable lesions at either the primary 
or metastatic location [201]. As a subset of these drugs that target metastatic may have no 
activity in the setting of measurable disease, preclinical trials conducted in dogs in the 
micrometastatic disease setting may provide compelling data defining their activity and support 
their advancement in the drug development pathway. 


8 . 4.6 

Key challenges and recommendations for using canine models 

8 . 4 . 6.1 

Challenges of population structure in dog models 

The many advantages of the canine model have been discussed in this chapter, but one should 
also be aware of its potential challenges. Most notably, the very thing that makes dogs so 
genetically useful - isolated breeding - also presents limitations and complications. Breeding of 
domesticated dogs is not natural or random. Rather, dogs are bred for diverse types of selection 
by humans. This must be accounted for in dog research, especially in genetics. For instance, 
statistical power is affected by the extent of the difference that the design expects to detect. 
Keeping other variables that influence statistical power the same, a larger effect-difference is 
associated with increased power (i.e., that translates to an increased chance of a successful true 
positive detection). The larger effect sizes often observed for dog traits (compared to humans) 
are powerful, but this is in exchange for complex structural patterns and relatedness that can 
confound or mask the detection of that effect. Equally important, dogs can be fixed for the gene 
alleles that have the greatest effect on the overall disease/trait of interest; and this information 



would be missed by genetic mapping. Population structure can generate false positives and be 
difficult to separate from the traits of interest. It is also possible that some traits are inextricably 
linked to population structure. 

These issues have different implications for single-breed and cross-breed studies. In 
single breed studies, one must measure the effects of population structure on the GWA analysis 
and should identify the portion of the genome that was not sampled due to high levels of 
homozygosity in that breed. To address the latter with conventional methods (mainly without 
breeding studies), one must move laterally, first, to related breeds (likely to share disease- 
associated variants) and then to increasingly unrelated breeds. To address these issues across 
breeds, one must be careful to understand the difference between stereotypes associated with 
most or all members of a breed [35, 276, 298] and phenotypes that clearly segregate within 
breeds [42, 299-301]. If there is a mixture of breeds studied in a single analysis there should be 
sufficient breed variation while avoiding over-representation of any breed groups (unless they 
were evenly distributed across case-control groups or QTL phenotypes). That would serve to 
avoid strong population structure, but its effects should still be measured. If one had sufficient 
power, such analyses could be done in multiple GWA sets (each would have its own 
recognizable population structure and be analyzed separately). In the next section we propose 
additional ways to mitigate these challenges and suggest how investigators can use dog 
populations to avoid biases that are common in human biomedical research. 


8A6.2 

Recommendations for optimal results in canine preclinical research 

While the previous section suggests how dog breeding practices can introduce a major type of 
bias - population structure - we argue that dogs offer ways of circumventing biases that can be 
difficult to avoid in humans. First, human studies are extremely susceptible to very strong 
socioeconomic impacts [302], And second, the availability of hundreds of popular dog breeds 
that exist in a clinical context makes it possible to avoid the bias present in single human ethnic 
groups, single rodent strains and single dog breeds. This advantage can be appreciated by quickly 
reviewing the ways that bias is generally addressed [303]. For experimental studies, one can 
begin by randomizing assignment to comparison groups at baseline. In observational studies, one 
can begin by minimizing or stratifying heterogeneity to reduce differences between groups. And 



for both of those study designs, one can then i) measure and report the groups’ baseline data, and 
ii) if groups are unequal, describe the extent and direction of bias and discuss the effects of those 
on the interpretation of the study. For example, the issue of reducing differences between groups 
in observational studies is illustrated by the study of OS risk in three separate breed-specific 
GWAS’s [42], 

In the Introduction to this book, we discussed the causes of irreproducibility in preclinical 
research. Among the ways to prevent the most common problems are proper experimental (e.g., 
blinding and randomization) and analytical design, good record keeping, and careful reporting. 
Below we list key recommendations to increase the reproducibility of preclinical canine studies 
and to optimize their translation to human cancer patients: 

• Improved genome annotation and development of breed-specific catalogs of all genetic 
variation for select breeds 

• Improved genomic annotation of banked canine tumor biospecimens to aid in cross-species 
genomic analysis 

• Establishment of in vitro canine tumor cell lines to evaluate biological activity and provide 
support for in vivo evaluation of novel therapeutics in dogs with spontaneous cancers 

• Increased knowledge of key biology (e.g., immunity and aging) 

• Development of centers of excellence for canine and feline common-disease epidemiology 

• Development of centers of excellence for targeted biological therapies (e.g., antisense nucleic 
acids, siRNA, stem cells, gene therapy and genomic editing) 

• Prospective, adequately powered studies with sample sizes sufficient to detect drug effects with 
the outcome parameters used 

• Establishment and validation of primary and secondary endpoints in preclinical trials, 
including, but not limited to, determination of pharmacokinetic and pharmacodynamics 
relationships, surrogate biomarker identification, and comparative imaging endpoints 

• Standardized data reporting and inclusion of electronic data capture methods; standardized 
methods to evaluate response criteria and health-related quality-of-life assessment 

• Inclusion of dogs with spontaneous cancers in preclinical trials conducted in the setting of 
microscopic or minimal residual disease to inform the development of drugs targeting 
metastatic progression 

• Increased investigation of clinical imaging methods 



8.4.7 


Conclusions 

Going forward, the greatest contributions of canine models are likely to be in two areas. The first 
area is genetics - making use of their tractability for genetic mapping of complex disease risk. 
That will in turn lead to an unprecedented understanding of gene-gene and gene-environment 
effects. Predictive risk models will be built and tested in prospective studies. The second area of 
great promise is the full spectrum of translational medicine. This will take advantage of the large 
population of dogs and the established setting of veterinary health care. Knowledge of germline- 
risk genetics will lead to new-targeted approaches in both preventive and therapeutic medicine. 
For many diseases shared by humans and dogs, drugs under development for humans may be 
tested in veterinary clinical trials (even before they are approved by the FDA for human trials). 
Canine genomics and translation are converging with synergistic effects [10]. Cancer is a 
particularly attractive area for canine translational genomics because many targeted pathways 
can be assayed in tumors and that information can be used for drug selection. We propose that 
the strengths of dog models result in dramatically increased power and, potentially, reduced bias. 
Canine models thus represent a unique resource for significantly improving the reproducibility of 
human preclinical research. 



Figure Legends 



Figure 8.4.1: Popular Breeds and the Percent of Deaths from Cancer. On the Y-axis, breeds are ordered 
according to percent of deaths due to cancer [data from 63], Next to breed name on X-axis in parenthesis is the 
popularity of breed rank-ordered for 2013 according to the AKC (https://www.akc.org/reg/dogreg stats.cfm) . 
All pictures used from (www.akc.org/breeds/) . 

Figure 8.4.2: Estimated New Cases and Deaths for Common Cancers in the USA. According to Centers for 
Disease Control (CDC), cancer is narrowly second only to heart disease for the leading causes of death. In 
2011, heart disease claimed 596,577 lives, while cancer took only slightly less at 576,691 lives 
(http://www.cdc.gov/nchs/fastats/leading-causes-of-death.htm) . Coupled with loss of life is also the fininacial 
burdan. In 2010, cancer prevelance cost have been estimated between 124.5-216.6 Billion USD 
[http://www.cancer.org/cancer/cancerbasics/economic-impact-of-cancer; 304], This suggests that landmark 
revolutions are needed in treatment and care of cancer patients. No animal model completely recapiltureles the 
humans cancers perfectly, but the naturally occuring cancers in dogs more closely resemble the disease then do 
other animal models. Data for figure taken from “How Common is Cancer”, 
http:// seer, cancer. go v/statfacts/html/all .html . 

Figure 8.4.3: Number of Publications Related to Dogs and Cancer. We performed a search using the 
PubMed database (http://www.ncbi.nlm.nih.gov/pubmed) for publications related to dogs and cancer. We used 
the following search terms: “dog OR dogs OR canine OR dogs AND cancer”. Years were grouped and average 
publications for years calculated. From this, we calculated the annual growth rate and total percent change 
(Percent Growth Rate = Percent Change / Number of Years; http://www.miniwebtool.com/percent-growth- 
ratecalculator/?present_value=391&future_value =773&num=12). 

Figure 8.4.4: Germinal Centre of DLBCL. Antigen-activated B-cells differentiate into centroblasts that 
undergo clonal expansion in the dark zone of the germinal centre. During proliferation, the process of somatic 
hypermutation introduces base-pair changes that can lead to changes in the amino-acid sequence. Centroblasts 
then differentiate into centrocytes and move to the light zone, where the modified antigen receptor, with help 
from other immune cells, is selected for improved binding to the immunizing antigen. Newly generated 
centrocytes that produce an unfavorable antibody are removed. Cycling of centroblasts and centrocytes between 
dark and light zones appears to be mediated by a chemokine gradient. Antigen-selected centrocytes eventually 
differentiate into memory B cells or plasma cells. Centroblasts with genetic alterations that do not undergo 
apoptosis as expected can become GBC DLBCL. Likewise, plasma cells can become ABC DLBCL. Not 
Shown: Thymic cell which leads to PMBL. Listed are several known malignant transformations, adapted from 
[117,305,306], 

Figure 8.4.5. Integration of pet dogs with cancer into translational drug development studies. Canine 
cancer models compliment the use of both conventional preclinical models (mouse, research-bred dog and non¬ 
human primate) and human clinical trials and their inclusion in preclinical and translational studies will 
facilitate the rapid intermediate evaluation of agents prior to or after early human trials. Translational drug 
development studies in dogs may answer important questions about a new drug candidate such as toxicity, 
biological activity, and establish pharmacokinetic pharmacodynamic relationships for an agent before it enters 
human studies. Importantly, the comparative approach may answer questions that emerge in early phase human 
trials such as optimized dosing schedules, combination therapies, and the establishment of surrogate biomarkers 
or molecular imaging endpoints that will inform the evaluation of these agents as they move into later stages of 
development. Importantly, the totality of information generated from this comparative and integrative approach 
will likely reduce the late attrition rate of new cancer therapeutics and contribute to the identification of agents 
most likely to succeed in human clinical trials. Reprinted from [51] with permission from Macmillan Publishers 
Ltd: Nature Rev Cancer, copyright 2008. 







Figure 8.4.6. Comparative imaging study in dogs with sinonasal tumors investigating the spatiotemporal 
stability of Copper(II)-diacetyl-bis(N4-methylthiosemicarbazone) (Cu-ATSM) and 3’-deoxy-3’- 18 F- 
fluorothymidine (FLT) positron emission tomography distributions in during radiation therapy. (Upper 
panel) Sagittal positron emission tomography/computed tomography (PET/CT) slices are shown from a dog 
with sinonasal carcinoma pretreatment (pre) and midtreatment (mid) with intensity modulated radiation therapy. 
Cu-ATSM (middle) and FLT (bottom) scans demonstrate stable spatial distributions of both radiotracers during 
therapy. (Lower panel) Voxel-based scatter plots comparing pretreatment (pre) and midtreatment (mid) Cu- 
ATSM and FLT standardized uptake value (SUV) distributions and Spearman rank correlation coefficients 
(upper left) for dogs with nasal carcinoma or sarcoma. Spatial distributions and uptake of dose painting targets 
Cu-ATSM and FLT remain stable through mid-treatment, regardless of histology. Reprinted from [234] with 
permission from Elsevier. 
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Tables 



Table 8.4.1 Proportional Mortality due to tumors/neoplasm over Years by Breed. Due to the genetic consequences of 
pure-breeding, many dog breeds have inherited very high incidence rates of specific cancer types. Here, we look at the 
proportional mortality for the development of a neoplasm in different breeds during different years as reported. 
Proportional Mortality calculated as number or deaths due to tumors and/or neoplasm /total deaths in breed x 100. Data 
taken from [63, 170, 307-309]. 


Breed 

1997 ^ 307 -' 

2003 13091 

2005 13081 

2010 1631 

2011 11701 

Bernese Mountain dog 

32.6 

34.2 

41 

45.7 


Boxer 

29.6 

11.6 

37 

38.5 

44.3 

Cocker Spaniel 

22.2 

14.6 


29.4 

27.7 

Dachshund 


17.3 

8 

16.7 

8.9 

Doberman 

22.2 

12.9 

23 

26 

26 

German Shepherd dog 

14.8 

9.6 

16 



Golden retriever 


20.3 

30 

38.8 

49.9 

Great Dane 

12.3 


13 


22 

Greyhound 

12.3 


11 


21.6 

Irish wolfhound 

24.8 


22 


31.8 

Labrador retriever 


13.1 

21 

31.2 

34 

Newfoundland 

16.8 

5.6 

14 

27.1 

1.99 

Poodle 


8.5-16.3 

10 


1.1-2.7 

St. Bernard 

13.1 

5.7 

19 


26.9 

Total Dogs Investigated 

222,000 

2,928 

350,000 

15,881 

74,556 



Table 8.4.2: Breed cancer specific mortality for OS and mammary tumors. Listed are the mortality from tumors per breed, the rank among 
breeds for the development of OS and mammary tumors, the interquartile age for longevity, the AKC height, and breed standard weights. Note the 
trend for OS, the higher the height/weight the more likely the breeds are the develop OS. 


Breed 

Mortality 
from Tumors 
per 10,000 
dog-years at 
risk 13083 

Risk Rank 
Mammary 
Tumors [310] 

Incidence Rate 
for Mammary 
Tumors [310] 

Incident 

Rate Bone 
Tumors [311] 

Breed 

Bone 

Tumor 

Rank 33113 

Interquartil 
e Range of 
Ages 32803 

Size (kg) 33123 

AKC Height 
(Inches) At 
Shoulders 33133 

American cocker 
spaniel 


5 

192 





13.5-15.5 

Beagle 


29 

95 




10 

13-15 

Bearded Collie 


45 

50 



12.2-14.3 


20-22 

Belgian Tervuren 


42 

51 




32 

22-26 

Bernese Mountain 

Dog 

306 

41 

54 

26 

8 


45 

23-27.5 

Bichon Frise 


8 

172 



9.5-14.8 


9.5-11.5 

Border Collie 


43 

51 



11.5-15 

17 

18-22 

Border Terrier 


24 

116 



8.9-13.1 

6 

11.5-15.5 

Boxer 

203 

3 

256 

13 

12 

7-11.6 

29 

21.5-25 

Cairn Terrier 


3 

148 



10.6-15.5 


10 

Cavalier King Charles 
Spaniel 

24 

38 

64 



8.1-12.3 

6 

12-13 




Dachshund 
(smooth/wired, 
normal size) 

21 

27 

104 



9.2-14.3 

(Miniature) 

7 


Dalmatian 


36 

72 



11.5-14.0 

25 

19-23 

Doberman Pinscher 

168 

2 

297 

24 

10 

6.2-11.0 

35 

24-28 

Drever 

29 

39 

62 






English Cocker 

Spaniel 


13 

142 




14 

15-17 

English Springer 
Spaniel 


1 

319 



10.4-14 

23 

19-20 

Finnish Hound 


50 

17 






Finnish Spitz 


47 

40 





15.5-20 

Flat-Coated Retriever 


31 

94 

35 

7 


29 

22-24.5 

German Pointer 


16 

128 





23-25 

German Shepherd dog 

71 

9 

170 



9.2-12.9 

37 

22-26 

Golden Retriever 

55 

35 

73 

6 

20 

11.0-14.09 

32 

21.5-24 

Great Dane 

119 



45 

3 

4.0-9.0 

30 

28-32 

Greyhound 

58 



30 

6 

8.1-12.0 



Hovawart 




28 

9 




Irish setter 


19 

126 

8 

14 




Irish Wolfhound 

296 



99 

1 


54 


Jack Russell Terrier 


26 

105 



9.3-15.7 

7 


Jamthund 


21 

122 








Labrador Retriever 

45 

28 

96 

Leonberger 

197 

7 

176 

Lhasa Apso 




Miniature Dachshund 

6 

20 

126 

Miniature Schnauzer 


33 

77 

Miniature/toy poodle 


15 

136 

Mongrel 

34 

25 

116 

Munsterlander 


30 

95 

Newfoundland 

105 

49 

21 

Norwegian Elkhound 


32 

80 

Nova Scotia Duck 
Tolling Retriever 


17 

126 

Papillion 


10 

158 

Petite basset Griffon 


18 

126 

Pyrenees 

108 



Rottweiler 


22 

121 

Rough-haired Collie 


51 

5 

Saint Bernard 

172 



Samoyed 


37 

72 

Shetland Sheepdog 


46 

47 

Shih-tzu 


34 

74 


10.6-14.0 


30 







53 

4 






7.7-15.3 















11.1-15.6 


Miniature:>10 
<15; Toy: <10 











22 

11 


64 






19.5-20.5 




20 

17-21 





8-11 





13-15 





25-32 

36 

5 

5.5-10.2 

45 

22-27 



9.4-13.8 


22-26 

78 

2 



25.5.-27.5 





19-23.5 



11.7-13.8 


13-16 


9.2-15.6 


8-11 





Soft Coated Wheaten 

Terrier 


4 

199 





17-19 

Springer Spaniel 

44 







19-20 

Standard Poodle 

18 

23 

120 




25 

>15 

Standard Schnauzer 


14 

142 

9 

13 



17.5-19.5 

Swedish Hound 


44 

50 





11.5-13.5 

Tibetan Spaniel 


48 

29 





10 

Watchel dog 


11 

149 






Weimaraner 






11.1-13.5 

28 

23-27 

West Highland White 
Terrier 


40 

60 



10.4-14.9 


10-11 

Yorkshire Terrier 


6 

188 



10.0-15.1 

3 



Table 8.4.3: Overrepresentation of specific cancers in specific breeds 


Cancer 

Breed 

Odds Ratio (OR) 

Relative Risk 

(RR) 

Reference 

Anal Sac Gland Carcinoma 


Cavalier King Charles Spaniel 

3.36 


[314] 


English cocker spaniel 

11 


[314] 


English springer spaniel 

3.65 


[314] 


German Shepherd Dogs 



[315], [316] 


Golden Retrievers 



[315], [316] 



Fibrosarcoma 


Basset Hound 


2.9 

[317] 


Border Collie 



[318] 


Doberman Pinscher 


2.1 

[317] 


English Setter 


2.4 

[317] 


German Shepherd 


1.5 

[317] 


Great Dane 


2.6 

[317] 


Golden Retriever 

3.64 


[318] 


Labrador Retriever 


1.8 

[317] 








Saint Bernard 


3.7 


[317] 


Gastric Carcinoma 


Belgian Shepherd 



[319], [320] 


Bouvier des Flandres 

36.5**** 


[321] 


Chow-Chow 


46.2 

[321], [322], [323] 


Collie/Rough Collie 

26 i**** 


[319], [321] 


Groenendael (Belgian Shepherd 
Dog) 

34 5 **** 


[321] 


Norwegian Elkhound 

6 i**** 


[321] 


Norwegian Lundehund 



[324],[325] 


Staffordshire Bull Terrier 



[319], [323] 


Standard poodle 

2 6 **** 


[321] 


Tervuren 

56 i**** 


[324], [326] 


Hemangiosarcoma 

Border Collie ^ I [318] 


Boxer 

>4 


[327], [328], [329], 
[330] 

German Shepherd 

>3 


[327], [331], [332] 


Golden Retrievers 


6.15** 


[333], [334] 
























Lymphoma 


Airedale Terrier 


2.7 

[317] 

Basset Hounds 


4.1 

[317] 

Beagle 


2.57 

[339] 

Border Collie 

1.32 


[340] 

Bouvier des Flandres 


2.31 

[328], [339] 

Boxers 

3.26 


[340], [341] 

Bull Mastiff 

6.76 


[340], [342] 


Bulldog 


4.73 


[328], [340] 













Cocker Spaniels 


1.15 

Dobermans 

2.75 


English Springer Spaniel 

1.27 


German Shepherds 

2.03 


Golden Retriever 

2.35 


Irish Setter 

1.18 


Labrador retriever 


1.7 

Miniature schnauzer 

1.57 


Old English Sheepdog 

2.24 


Rottweiler 

6.01 


Saint Bernard 


2.63 

Scottish Terrier 


9.16 

Staffordshire bullterrier 

1.25 




Malignant Histiocytosis/ Histiocytic sarcoma 


Bernese Mountain Dog 

45 


Flat-Coated Retriever 

62 


Golden Retriever 



Labrador Retriever 




Pembroke Welsh Corgis 


9.7 


[339] , [343] 

[341], [340] 

[340] 

[328], [341] 

[340], [344] 

[340] 

[317] 

[340] 

[339] , [340] 

[340] , [341] 

[339] 

[328], [339] 

[340] 

[345], [346], [347], 
[348] 

[345], [349] 

[350] 

[350] 

[351] , [352] 





























Rottweiler 





Mast Cell Tumors 


American Staffordshire 
Beagle 

Boston Terrier 

Boxer 

Bullmastiff 

Chinese Shar-Pei 

Dutch Pug 

English Pointer 

English Setter 

Fox Terrier 

Golden Retrievers 

Labrador Retriever 

Rhodesian Ridgeback 

Staffordshire Bull Terrier 

Vizsla 

Weimaraner 


4.21 

10.24 

3.84 
3.41 

1.84 
1.88 


[328] 

[353] 

[328], [354], [355] 
[328], [354] 

[328] 

[328] 

[328] 

[328] 

[328] 

[353], [355] 

[328], [344] 

[328] 

[328] 

[355] 

[328], [338] 

[328] 


Melanoma 







Airedale Terrier 


3.41 


[328] 


Beauce shepherd 

2.38 


[77] 

Boxer 

4.35 


[328] 

Chesapeake Bay Retriever 

5.84 


[328] 

Chinese Shar-Pei 

24.43 


[318] 

Chow-Chows 

40.37 


[318] 

Cocker Spaniel 


1.65 

[356] 

Doberman Pinscher 

3.02 


[328] 

German Shorthaired Pointer 


2.5 

[317] 

Golden Retriever 

2.59 


[328], [344] 

Gordon Setter 


5.17 

[356] 

Irish Setter 

2.23 


[328] 

Labrador retriever 

1.72 


[77] 

Miniature Schnauzer 

7.53 


[328] 

Poodle 

1.63 


[77] 

Rottweiler 

1.17 


[77] 

Schnauzer 

1.93 


[77] 

Scottish Terrier 

3.07 


[328], [77] 

Vizsla 

17.34 


[328] 


Osteosarcoma 




Beagle 

3.8 


[357] 

BMD 



[311] 

Boxer 



[311] 

Doberman Pinscher 

2.3 


[311], [17] 

Flat-Coated Retrievers 



[345], [311] 

German Shepherd Dog 

2.2 


[357] 

Golden retriever 

2.1 


[311], [17] 

Great Dane 

12 


[311], [358] 

Greyhound 

17.3 


[311], [42] 

Hovawart 



[311] 

Irish setter 

3.5 


[311], [17] 

Irish Wolfhound 

20.7 


[311], [17] 

Labrador Retriever 

1.3 


[17] 

Leonberger 



[311], [359] 

Miniature Poodle 

2.7 


[357] 

Newfoundland 



[311], [359] 

Rottweiler 

14.6 


[311], [358] 

Saint Bernard 

11.9 


[311], [17] 

Scottish Deerhound 



[300] 


Standard schnauzer 


[311] 


































Prostate Cancer 


Airedale Terrier 

2.46 

Beagle 

1.49 

Bernese Mountain Dog 

2.41 

Bouveir des Flandres 

5.51 

Doberman Pinscher 

1.97 

German Shepherd Dog 

2.6 

German Shorthaired Pointer 

1.89 

Norwegian Elkhound 

3.28 

Scottish Terriers 

3.81 

Shetland Sheepdogs 

1.82 


[ 360 ] 

[17], [360] 
1361] 

"[361] 

[360] 

"[357] 

[360], [361] 
[360] 

[360], [361] 
[360] 


Soft-Tissue Sarcoma 


Boxer 

Golden Retrievers 
Labrador Retriever 
Rhodesian Ridgeback 
Siberian Husky 


[328] 

[344], [328] 
[328] 

[328] 

[328] 


Squamous cell Carcinoma 






Dalmatian 

6.94 


[328] 

Labrador Retriever 

2.41 


[318] 

Poodles 

4.61 


[318] 

Samoyeds 

24.63 


[318] 

Scottish terrier 


2.5 

[317] 

Weimaraner 


2.5 

[317] 


Squamous Cell Carcinoma of the Digit 


Beauceron 



[362] 

Black Standard Poodle 

5.9 


[363], [299] 

Briard 

10.4 


[362], [299] 

Dachshund 



[364] 

Flat-Coated Retriever 



[364] 

Giant Schnauzer 

22.7 


[299] 

Gordon Setter 

11.1 


[299] 

Kerry Blue Terrier 

7.7 


[299] 

Labrador Retrievers 



[363] 

Rottweiler 



[364] 



Transitional Cell Carcinoma of the Bladder 
































Beagle 

4.15 


[365] 


Scottish terriers 

18.9 


[365] 


Shetland Sheepdogs 

4.46 


[366] 


West Highland White Terriers 

3.02 


[365] 


Wire-haired Fox Terriers 

3.2 


[365] 


*Female gonadectomized all ages 

**Scrotal 

***Non-scrotal 

****The PMR of gastric carcinoma by breed was calculated by dividing the number of gastric carcinomas in a breed by all tumors in 
the breed over the number of gastric carcinomas in all other breeds divided by all other tumors in the other breeds in the database 
[321]. 




Table 8.4.4 Comparative aspects of spontaneous cancers in humans and dogs 


Osteosarcoma* 


Incidence or Prevalence Histology Biological Behavior Treatment Shared Molecular and Genetic Factors 

Risk Factors 


Human • 1000/year 

• Adolescent disease 
(peak onset 10-14 years) 


• 85-95% high grade • 90% in the appendicular 

skeleton 

• Marked aneuploidy and 

karyotypic complexity • Metaphyseal region of long 

bones (distal femur > proximal 
tibia > proximal humerus) 

• 85-90% clinically confined to 
primary site at presentation; 85- 
90% develop metastasis before 
2 years without chemotherapy 
(lung > bone > soft tissues; 
regional lymph nodes <10%) 


• Surgical amputation, limb- 
sparing procedures, or 
rotationplasty 

• Adjuvant multi-agent 
chemotherapy: doxorubicin, 
cisplatin, methotrexate 

• 5-year survival of 60-70% 
(nonmetastatic disease setting) 
or 10-30% (if metastasis found 
at initial diagnosis) 


Canine • >10,000/year 

• Middle-aged to older 
dogs (peak onset 7-9 
years) 


• 95% high grade 

• Marked aneuploidy and 
karyotypic complexity 


•75% in the appendicular 
skeleton 

• Metaphyseal region of long 
bones (distal radius > proximal 
humerus > distal femur) 

• 85-90% clinically confined to 
primary site at presentation; 
90% develop metastasis before 
1 years without chemotherapy 
(lung > bone > soft tissues; 
regional lymph nodes 4.4%) 


• Surgical amputation, limb- 
sparing procedures 

• Adjuvant therapy: Platinum- 
based (carboplatin, cisplatin) 
chemotherapy alone or in 
combination with doxorubicin 

• >50% of dogs do not live 
beyond 1 year postamputation; 
90% die of disease by 2 years 


• Increased inherited risk 
in Scottish Deerhounds, 
Rottweilers, greyhounds, 
Great Danes, Saint 
Bernards, Irish 
wolfhounds 


• p53 gene mutations and loss of 
heterozygosity, p53 overexpression 

• RBI copy number loss, reduced RB1 
protein expression 

• PTEN gene deletion, decreased PTEN 
expression 

• MYC copy number gain 

• IGF-1/IGF-1R activation enhances 
anchorage independent growth and 
invasion in OS cell lines 

• Constitutive activation of STAT3 
associated with aggressive biological 
behavior 

• Abberant MET expression enhances 
migration in response to ligand (HGF); 
coexpression and heterodimerization of 
MET, EGFR, and Ron alters signal 
transduction and promotes resistance to 
targeted therapeutics 

• mTOR pathway activation enhances 
survival in OS cell lines 

• High ezrin expression associated with 
early metastasis and poor outcome 


^References: [7, 10, 28, 52, 58, 59, 65, 73, 74, 79, 80, 83, 149, 150, 163, 165, 190, 193, 194, 198, 262, 294, 296, 367] 



Non-Hodgkins Lymphomaf 


Incidence or Prevalence Histology Biological Behavior Treatment Shared Molecular and Genetic Factors 

Risk Factors 


Human • 19.6/100,000 


• Histologically • Classified according to the 

classified according to WHO clinical staging system 
the NCI Working 

Formulation * Nodal or extranodal at 

presentation 

• DLBCL (high grade) 
most common (30-44%) 

• Follicular (low grade) 
next most common 
( 22 . 1 %) 


• CHOP-like chemotherapy 
and rituximab (R-CHOP) 

• Involved field and/or total 
body irradiation 


• Gene expression profiling supports the 
organization of canine and human DLBCL 
into molecular subtypes, activated B-cell 
(ABC) DLBCL and germinal center-like 
B-cell (GCB) DLBCL 


• 5 year overall survival of 50 • NF-xB/p65 canonical pathway activation 

65% for DLBCL patients j n dlbCL 

receiving R-CHOP 

chemotherapy • Upregulation of miR-17-92 cluster 


• PTEN gene deletion (human GCB 
DLBCL, canine DLBCL cell lines) 


Canine • 20-107/100,000 

• Higher incidence in 
Boxers, bull mastiffs, 
basset hounds, St. 
Bernards, Scottish 
terriers, Golden 
Retrievers, Airedales, 
bulldogs 


• p53 gene mutations 


• Histologically 
classified according to 
the NCI Working 
Formulation 

• DLBCL (high grade) 
most common (36-58%) 

• 35-40% are T-cell 
malignancies 


• Classified according to the 
modified WHO clinical staging 
system for domestic animals 

•Multicentric nodal 
presentation most common 


• CHOP-based chemotherapy 

• Complete remission rates of 
60-90% 

• Median survival 10-14 
months for DLBCL; median 
survival 6-9 months for 
peripheral T-cell (high grade) 
malignancies 


• MYC copy number gain and protein 
overexpression 

• INK4A-ARF gene deletion in human 
ABC DLBCL; deletion of CFA 11 
(harboring INK4A locus) in canine high 
grade T-cell lymphoma 

• MYC-IgH translocations present in 
Burkitf s lymphoma 


fReferences: [7, 10, 28, 53, 65, 78, 82, 84, 85, 95, 98, 99, 101, 103, 104, 107, 109, 110, 112, 117, 121, 125, 126, 190-192, 213] 



Bladder Cancerf f 


Incidence or Prevalence Histology Biological Behavior 

Risk Factors 


Human • >65,000/year • Graded according to • Variable initial tumor location 


•2:1 ratio for the 
occurrence of bladder 
TCC in men versus 
women 


the WHO/ISUP 
classification of human 
urothelial neoplasms 

• >65% low-grade 


within the bladder 

• 50% of invasive TCC 
metastasize to regional lymph 
nodes, lungs, bone, other 


noninfiltrative TCC organs 


• 20% high-grade 
invasive TCC 


Canine • 2% of canine 
malignancies 

• 2:1 ratio for the 
occurrence of TCC in 


• Classified according to 
modified WHO/ISUP 
classification system for 
dogs 


female versus male dogs • >90% infiltrative TCC 


• Higher incidence in 
Scottish terriers, West 
Highland white terriers, 
Shetland sheepdogs, 
beagles, wire hair fox 
terriers 

• Risk associated with 
obesity, insecticide and 
herbicide exposure 


• <10% non-infiltrative 
tumors 


• Demonstrated risk associated 
with obesity, insecticide and 
herbicide exposure 

• Majority of canine TCC 
located in trigone region of the 
bladder with urethral (56%) 
and/or prostatic involvement 
(29%) 

• Metastasis present in 15-20% 
of dogs at diagnosis; 50% of 
dogs at death (regional lymph 
nodes; lung, bone, other organs, 
including the skin) 


ffReferences: [7, 10, 28, 65, 67, 69, 72, 75, 213, 225, 365, 366, 368-370] 


Treatment 


Shared Molecular and Genetic Factors 


• Low-grade TCC: • Reduced expression of androgen receptor 

transurethral resection and in higher-grade and -stage disease; high 

intravesical therapy can be androgen receptor expression associated 

curative with less-aggressive forms of TCC 


• High-grade TCC: 
cystectomy, adjuvant 
chemotherapy (cisplatin, 
methotrexate, vinblastine, 
doxorubicin) 

• 5-year survival rates 78% 
(muscle-invasive lymph node¬ 
negative disease), 47% 
(extravesical lymph node¬ 
negative tumors) or 31 % 
(lymph node-positive disease) 


• EGFR transcript and protein 
overexpression 

• p53 overexpression 

• Reduced or absent RB1 protein 
expression 

• Cyclooxygenase-2 overexpressed in 
invasive TCC and carcinoma in situ 

• High survivin expression and nuclear 
localization in TCC tumors 


• Coarse fraction radiotherapy, • Telomerase activity detected in human 

intensity-modulated TCC tumors and in urine samples from 

radiotherapy dogs with TCC 

• Adjuvant chemotherapy: • Overexpression of DNMT1 associated 

cyclooxygenase inhibitors, with more aggressive disease 

platinum agents (carboplatin, 

cisplatin), vinblastine 


• Median survival of 4-7 
months with single-agent drug 
treatment; >8 months with 
multi-drug treatments 




CHOP, cyclophosphamide, doxorubicin, vincristine, and prednisone; DLBCL, diffuse large B-cell lymphoma; DNMT1, DNA methyltransferasel; EGFR, epidermal growth factor 
receptor; HGF, hepatocyte growth factor; IGF-1, insulin-like growth factor-1; IGF-1R, IGF-1 receptor; INK4A-ARF, cyclin-dependent kinase inhibitor 2A; ISUP, International 
Society of Urologic Pathology; mTOR, mammalian target of rapamycin; NCI, National Cancer Institute; NFkB, nuclear factor kappa-B; PTEN, phosphatase and tensin homolog; 
RBI, retinoblastoma 1; STAT3, signal transducer and activator of transcription 3; TCC, transitional cell carcinoma; WHO, World Health Organization. 
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1. What is the meaning of life? 






2. Evolution of emotions: origins 

If procreation is the meaning of life, 

our goal is survival until having babies, •• 
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How important is vision? 



> 6 of 38 phyla on earth have eyes 

> Those 6 represent 95% of all living animals 
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3. Why and how to study emotion: what is it? 

Emotion : 

s conscious mental reaction (as anger or fear) 
s subjectively experienced as strong feeling 
s usually directed toward a specific object 
s typically accompanied by physiological and 
behavioral changes in the body 






Idea of emotions in ancient Greece 


The conscious mind is the charioteer that 
tames the wild beasts that are emotions 

- Plato (428-348 BC) 









Plutchik's wheel of emotions (1980) 






























> Heart beats faster & blood pressure go up (more blood to the 


muscles, heart, etc.) 

> Breathing rate increases & small airways in the lungs open 
wider for more oxygen 

> Extra oxygen goes to the brain, increasing alertness 

> Sight, hearing, and other senses become sharper 

> Increase of blood sugar and fats for energy 














Why study emotions like fear? 


Anxiety (fear w/o external stimulus) 

■ Panic 

■ Phobias 

■ Post-traumatic stress disorder 

■ Obsessive-compulsive disorder 

■ Generalized anxiety 




How are emotions like fear studied? 


■ Brain lesions (surgical in animals; 
accidental/stroke/cancer in humans) 

■ Brain stimulation 

■ Brain imaging 

■ Physiology & biochemistry 



Little is known about genetics of emotions 


■ Difficult due to complex genetics 

(humans are too diverse) 

■ Could reveal biochemical pathways 

for therapeutic targeting 
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Ongoing studies to map dog behaviors 

Data 1: dog owner survey of behavior 
across breeds (currently >40,000; 

http://vetapps.vet.upenn.edu/cbarq) 


Data 2: genetic marker data for individual 
pure breed dogs (-1,200, >30 breeds) 










Results 1: Principal components analysis 


Component Pattern 
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> Subset of fear & aggression cluster apart from other behaviors 

> (A) Reanalysis of only fear & aggression traits suggests dog 
and stranger-directed aggression are related to fear of 
strangers ... and these are distinct from owner-directed aggr. 











-LoglO(P-value) O -LoglO(P-value) 


Results 2: Genetic mapping of aggression 


A 


Stranger-directed aggression 




Owner-directed aggression 


chrl5:44,258,017 



> Consistent with PCA, stranger and dog-directed aggression 
are genetically similar... and distinct from other aggression 
























Results 2: Genetic mapping of fear 



> Stranger & dog-directed fear are genetically similar to each 
other and to stranger & dog aggression 

> Only those fear traits share both genome regions (vs. 3 others 
that have chrlO and either chr18 or chrX regions) 




















Results 3: precise mapping of genes 
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GNAT3 is very biologically-relevant (1) 
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GNAT3 is very biologically-relevant (2) 



C. Stangor, Beginning Psychology (v. 1.0) 
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IGSF1 is very biologically-relevant 



> ChrX IGSF1 is expressed predominantly in pituitary & hypothalamus 

> Human IGSF1 mutation results in thyroid hormone deficiency 





































Frequency of variants & domestication 



> Wolves seem to have high frequencies of fear-aggression variants 

> Dogs have low frequencies 




Conclusions 


1. The importance of emotions is underappreciated 

2. Many aspects of life appeal to emotions: art, 
politics, advertising , etc. 

3. We may often act irrationally due to emotions; 
awareness of this may mitigate damage 

4. Fear & anxiety are major human problems 

5. Dogs offer a genetic model of fear and aggression 
which may sometimes be the same thing 
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Outline 


1. Introduction 

2. DNA copy number variation (CNV) 

3. Genetic mapping of cancer risk 



The Genome Era : historical context 

• 2000 Genome Sequencing 

• 2005 Evolution in Action 

• 2007 Human Genetic Variation 



Genetics as a handle on biology & disease 
























Why study CNV in dogs? (1) 


Genetics 


> 15,000 YA 


150-200 YA 



450 inherited diseases 
Dog genome similar to human 
5-7X shorter lifespan v. humans 
75M dogs in USA 


• Intra-breed genetic variation low 

• Inter-breed high (27 v 5-10% hum) 
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Vonholdt etal, Nature. 2010 Mar 17. 












Why dogs? (2) 

Variation 

(morphological, physiological & behaviora 



65X difference sm/largest breeds (wt.) 
-250X smallest/largest individuals 
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Dog genome sequence & tools 



Functional, variation: Custom 1,000,000 feature 
microarray (Agilent CGH) 




Genetic markers: 175,000 feature microarray 

(lllumina SNP) 



Emerging: High-throughput sequencing 
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2A. Identifying copy number 
rnutations in cancer cells 
(in somatic cell DNA) 


DNA copy number in cancer cells 
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2B. Identifying inherited 
functional variation (in 
inherited germ line DNA) 



DNA copy no. variation (CNV) 


Copy number alteration >50 bp 
- germ line polymorphisms and mutations 
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CNV detection and effects 



Gene expression effects 


DNA structural variation 


Protein 

expression effect 


.A. 


e> 


Inversion 


Inversion 


Duplication 

Duplication 
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Alvarez & Akey, Mamm Genome 2012 












































Biological significance: rapid 
mutation mechanism in evolution 



Liberies et al. Naturwissenschaften doi:10.1007/s00114-008-0446-0 (2008) 


• Bats that feed on mammalian blood have 
4 copies of a Plasminogen Activator gene 

• Others, which feed on bird blood, have 1 copy 





CNV in humans 

• >>5000 CNVs described in humans 

• ~800 CNVs/individual; ~150 affect genes 

• Old, recurrent, and de novo events 

• Overrepresented biology: senses, immunity, 
metabolism, microRNA, disease susceptibility 

• Several disease-associations found 


Inter-individual variation -0.5% (by base pairs) 
(vs. single nucleotide polymorphisms/SNPs, 0.1%) 



CNV mutations and dog traits, e.g.: 


* 
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* 

* 
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Furnishings (mustache and eyebrows) 


Australian Terrier, Briard, Brussels Griffon, Cairn Terrier, Giant Schnauzer, Glen of 
Imaal Terrier, Havanese, Irish Wolfhound, Norwich Terrier, Old English Sheepdog, 
Portuguese Water Dog, Scottish Deerhound, Scottish Terrier, Shih Tzu, Standard 
Poodle, Standard Schnauzer, Toy Poodle, West Highland White Terrier, Yorkshire 
Terrier 


167 bp insertion 


Globoid cell leukodystrophy 


Irish setter 


78 bp insertion 


Cone-rod dystrophy, "locus 1" 


Miniature Longhaired Dachshund 


44 bp insertion 


Chondrodysplasia 


Trait (listed in order of variant size, SINE/LINE 

mechanism, CNVs with unknown effects) _ 

Gonadal dysgenesis, X-chromosomal monosomy 
(presence of single sex chromosome, X) 


Hemophilia A 


Skeletal myopathy and dilated cardiomyopathy 
(Duchenne muscular dystrophy) _ 


Hemophilia, "hemophilia B" 

Cone degeneration _ 

Copper toxicosis _ 


Hemophilia, "hemophilia B", factor IX deficiency 
Cone-rod dystrophy, "locus 3" 

Paroxysmal hypertonicity, "episodic falling 

syndrome (EFS)" _ 

Collie eye anomaly 


Oculoskeletal dysplasia 
Cone-rod dystrophy 


Hair ridge ("Ridgeback") and predisposition to 
dermoid sinus 


Basset Hound, Cairn Terrier, Cardigan Welsh Corgi, Chihuahua, Dachshund, 
Dandie Dinmont Terrier, Glen of Imaal Terrier, Grand Basset Griffon Vendeen, 
Havanese, Japanese Chin, Lancashire Heeler, Norwich Terrier, Pekingese, 
Pembroke Welsh Corgi, Petit Basset Griffon Vendeen, Scottish Terrier, Shih Tzu, 
Skye Terrier, Swedish Valhund, Tibetan Spaniel, West Highland White Terrier, 
Yorkshire Terrier 


5 kb insertion; retrotransposition of FGF4 
retrogene 


Loss of sex chromosome (normal complement of 
autosomes and a single sex chromosome, X, or 

77,XO) _ 

Large inversion _ 

> 2.4 Mb deletion 


Deletion 


>140 kb deletion 


39.7 kb deletion 


>30 kb deletion 


>20 kb deletion 


15.7kb deletion 


7.8 kb deletion 


1,267 bp deletion 
180 bp deletion 
133 kb duplication 


Alvarez & Akey, Mamm Genome 2012 


DNA structural / copy no. variation 
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aCGH of normal dogs 


Candidate breed-specific CNVs 
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CNV in CSMD1 of Rottweilers 


• CSMD1 frequently deleted in epithelial tumors 

• Somatic point mut. & allele loss associated with cancer 

• Deletion and reduced expression assoc, w/ poor outcome 


















Dog CNV: numbers & affected genes 


Genome Research 2009 

Resource 

Mapping DNA structural variation in dogs 

Wei-Kang Chen , 1,4 Joshua D. Swartz , 1,4,5 Laura J. Rush , 2 and Carlos E. Alvarez 1,3,6 

1 Center for Molecular and Human Genetics, The Research Institute at Nationwide Children's Hospital f, Columbus, 

Ohio 43205, USA; 2 Department of Veterinary Biosciences; The Ohio State University, Columbus; Ohio 43210, USA; 

3 Department of Pediatrics, The Ohio State University College of Medicine, Columbus, Ohio 43210, USA 


• Identified -200 CNVs (80 CNVrs); 75 genes 

• Dogs/rodents/humans have similar numbers 

• Several human disease genes affected 

• Similar gene functions are over-represented 
(e.g., immunity and olfaction) 







Breeds of interest 


Driven by veterinary investigators 


• Cavalier King Charles Spaniel 

• German Shorthaired Pointer 

• Labrador Retriever 

• Doberman Pinscher 

• Great Dane 

• Shar-Pei 

• Shih-Tzu 

• Cocker Spaniel 

• German Shepherd 

• Beagle 

• Border Collie 

• Yorkshire Terrier 

• Boxer 

• Bulldog 

• Greyhound 
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Functional variation: coat color 
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Yellow Great Dane 
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Schmutz & Berryere (2007) Animal Genetics 38:539-549 


















































Dominant Black dominance order 
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Kerns et al. (2007) Genetics 176:1679. 





Brindle is similar to Blaschko's lines 


Pigmentary mosaicism (not mutation, 


but effect of two genetic cell types) 



Brindle: 

Suggestive of a DNA mutation 
with an epigenetic effect 

Kerns et al. (2007) Genetics 176:1679. 









Golden Retriever 






















Characterization of K Black alleles 

























Brindle due to CBD103 expression 
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DNA methylation at CTCF locus 
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Brindle may be like X-inactivation 



Chao et al. (2002) Science 295: 345-347. 

























3. Mapping complex 
disease traits: 
osteosarcoma risk 



Greyhounds 

• European nobility used for coursing 

• AKC show, since 1885 

• Racer registration, since 1906 

• Increased risk for osteosarcoma in racer (17X RR) 



Show Racer 









Description 

Polyneuropathy 

Juvenile dogs show exercise intolerance and walking difficulties. In the later stages, there is severe 
muscle atrophy and ataxia 

Pancreatic acinar atrophy 

Diabetes accompanied by diarrhea 

Thyroid function 

Basal serum thyoxine (T4) and free thyroxine (fT4) concentrations were significantly lower in 
Greyhounds than in non-Greyhounds 

Vitreal degeneration 

Degeneration of the vitreous, often associated with cataract 

Sodium thiopentane sensitivity 

Sensitivity to sodium thiopentane and other barbiturate anaesthetics 

Neoplasia - Osteosarcoma 

Excess of osteosarcoma 

Persistent right aortic arch 

Aortic arch abnormally producing an encircling ring around trachea and oesophagus.clinical sigs of 
regurgitation, aspiration pneumonia and failure to thrive due to oesophageal compression 

Asthma 

Asthma, broncho-constriction - non specific bronchial hyperresponsiveness 

Meningoencephalitis 

Head tilting, ataxia, recumbency, circling, and blindness. Severe inflammatory changes in caudate 
nucleus, cerebellum and brain stem 

Cutaneous vasculopathy 

Skin and less often kidney lesions: hemorrhages, fibrinoid arteritis, thrombosis with deep, slowly 
healing ulcers of skin. Renal glomerular necrosis 

Epilepsy 

Fitting and seizures 

Malignant hyperthermia 

Hyperthermia and rigid paralysis after exercise, stress or exposure to halothane, caffeine . 

Sensorineural deafness 

Deafness starting at roughly 4 weeks 

Type 1 von Willebrand disease 

Blood clotting disorder 

Progressive Retinal Atrophy 

Loss of night vision progressing to total blindness 

Hindlimb lameness 

Lameness caused by avulsion of the tibial tuberosity 

Chronic superficial keratitis of 
the cornea 

Bilateral disease of the cornea -vascularised pigmented subepithelial growth 































Other Greyhound traits 



Running performance 




Single nucleotide 

polymorphisms (SNPs) 


. ACCTGCTATTTCGATT. 


. ACCTGCTAGTTCGATT. 


Genetic markers for mapping (usu. not causative mutations) 



lllumina SNP array: 

• ~175,000 SNPs (=1 SNPs/14,000 bp) 

• Avg call rate: 99.8%; Reproducibil: >99.9% 










Racers Show 
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Grouping dogs by OSA status 




OSA+/- racers 
group apart from 
shows 


O OSA+ Racer 
O OSA - Racer 
O Show 



• Removing “racer” 
effect, OSA- racer 
and shows group 
together 



























































































Genome wide association (GWA) 


Frequency of ea SNP allele (A or B) is 
measured in cases vs. controls 


SNP microarray: 
175,000 SNPs 




Which SNP alleles over whole genome have 
different frequency in cases vs. controls ? 










































Osteosarcoma genome wide 
association (GWA) & Results 

1. Case control study: 12 OSA+ racers 

12 OSA - racers 
12 OSA - show 

2. Second group of case-control racers for validation 


> Mapped complex genetic OSA risk loci: 15 hits 

> Validated 3 OSA loci in a second group of dogs 
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LRIG3 locus under selection 



Tracking footprints of artificial selection in the 
dog genome 

Joshua M. Aikey 3,1 , Alison L. Ruhe b r Dayna T Akey 3 , Aaron K. Wong b , Caitfin F. Connelly 3 , Jennifer Mad 
Thomas J. Nicholas 3 , and Mark W. Neff b c,d 1 


Table S3. 

Candidate selection 

regions with single-gene resolution 


Chr 

Coordinates, Mb 

Breed 

Gene 

Putative phenotype 

4 

82.12-83.12 

Greyhound 

CDH9 

Behavioral 

5 

80.92-81.92 

Greyhound 

ZFHX3 

Cranial morphology 

7 

51.14-52.14 

Brittany 

PIK3C3 

Behavioral 

9 

11.16-12.06 

Greyhound, Brittany 

50X9 

Skeletal morpholoqy 

ID 

5.19-6.19 

Greyhound 

LRIG3 

Elongated body axis 

18 

26.65-27.65 

Dachshund 

5EMA3D 

Morphology 

21 

17.71-18.71 

Poodle 

DLG2 

Behavioral 

23 

27.43-28.43 

Border Collie 

5AJ81 

Athletic performance 















Conclusions 



• Validated approach of cataloguing CNV in dogs 

• Identified structural variation affecting 100s of 
genes (e.g., Dominant Black in brindle) 

• Mapped and validated osteosarcoma risk loci 

• Racing selection may increase OSA risk 


Next steps & ongoing studies 

Next steps: identifying osteosarcoma-risk 
mutations, human relevance, translation 


Ongoing: 

• Canine lung and soft tissue sarcoma 

• Longitudinal cohorts: cancer risk 
in DoD military dogs 

• Determining canine Hb genetics 

• Mapping behavioral traits 
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